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SYMBOLS 


Most symbols are defined and used ad hoc in the text. Script capitals 
are linear vector spaces, capitals are matrices, except where otherwise 
indicated, and vectors are indicated by the superscript (“") or defined as 
such. Other symbols appearing routinely are listed below. 

( ) , ( ) , ( inverse, pseudo inverse, and transpose of ( ) 

0,0 estimate and error in estimate of ( ) 

= equivalence symbol 

n intersection of two sets 


® direct sum of two linear vector spaces 

£{ } linear vector space spanned by contents of { } 

Where a matrix appears, the column space of the matrix 
indicated. 


h 

K 

it 

H 

H m ,H u 

MS 

m(X) 

P 5 Po> p a 

q 

Q 

(H,Q) 

t 


vector (costate) defining a parameter as a linear 
combination of the state variables 

parameter (costate) space 

observable and unobservable subspaces of K 

matrix whose columns are the costates of a set of 
measurements 

matrices whose columns are bases of 

mean square value 

data type, given as a scalar function of the state 

posterior, prior, and initial covariances of the state 
estimation error 

MS measurement noise for a measurement 

diagonal matrix of MS measurement noise for a set of 
measur ements 

measurement sequence (the costates and noise matrix of a 
set of measurements) 

time 


U t) 


state vector 



{XA(t),t 0 <t<tp} 

x(t) 

X 

Xm>X u 

x m ,x u 

w 


W,W 0 ,W A 


y 


reference solution of the state equation 
state deviation vector, X(t) - X^ft) 
state space 

observable and unobservable subspaces of X 
matrices whose columns are bases of X m ,X u 
transformed state deviation vector 
posterior, prior, and initial covariances of w 
data vector for a set of measurements 
transition matrix of the linear state equation 
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EQUIVALENT MEASUREMENTS, OBSERVABILITY, AND CALIBRATION 
ACCURACY LIMITS IN LINEAR ESTIMATION 


Luigi S. Cicolani 
Ames Research Center 


SUMMARY 


Some aspects of the structure of the linear estimation problem are 
investigated in this report. In the first section, the processing of arbi- 
trary sets of measurements is considered. The notion of equivalent sets of 
measurements is defined and applied to obtain a unified view of such opera- 
tions as data compression, performance analysis, and selective processing of 
data. 


In the second section, the measurements to be processsed are assumed to 
be samplings, obtained at some arbitrary sequence of times, of one or more 
given data types or time-varying functions of the unknowns. The properties of 
the data types are then common to any sampling. One such property, observ- 
ability, is well known. The system is observable to the data types, provided 
all unknowns can be determined from some sampling of the data types. Other- 
wise, only some lesser number of linear combinations of the unknowns can be 
determined and the estimation problem can be reduced to one of estimating this 
smaller set of unknowns. 

Finally, the physical nature of measurements and estimation is considered 
in combination with observability analysis to obtain a theory for the calibra- 
tion accuracy limits inherent in estimation problems. All measurements are 
comparisons of like physical phenomena and the basis of comparison is made 
unique by international definition of physical units. The estimation process 
combines measurements in order to compare some physical system, as defined by 
a set of unknown state variables, with these physical units. Measurements 
separate into comparisons with physical units (calibrations) and comparisons 
among unknowns (hence, are nonlinear in the unknowns) . If all calibrations are 
carried out in an initial set of measurements, as is usually the case, then 
the processing of subsequent measurements becomes a comparison of the system 
with a set of intermediate physical standards. These standards appear as 
unobservable parameters in the problem and the accuracy to which they are 
estimated from the initial set of measurements is a lower bound, or calibra- 
tion accuracy limit, on the accuracy to which the system can be estimated 
without performing new calibrations. Intermediate standards for quantities 
having composite units, such as velocities, can also be defined from the 
elementary standards, and these are applied to examine the estimation of the 
speed of light as part of space vehicle orbit estimation problems. 



INTRODUCTION 


Linear estimation is the problem of computing the best estimate of a set 
of unknowns from noisy measurements of linear functions of the unknowns. This 
subject has received wide attention in the engineering literature in recent 
years and has applications in many fields. A particularly complex example is 
the estimation of space vehicle orbits where accuracy requirements have led 
to the development of radar station networks capable of generating vast 
amounts of data and of sophisticated data processing programs which estimate 
many unknowns. In these and other large-order estimation problems, physical 
insight is difficult to achieve and there is little theory of a general or 
routine nature for analyzing the relations among the basic factors in the 
problem and the estimation performance, or for analyzing the causes of such 
common difficulties as ill-conditioning of the information provided by the 
measurements. Such difficulties are often characteristic of the measurements 
themselves, but with the exception of observability (ref. 1), few measurement 
characteristics of a general nature have been identified and observability has 
itself received limited attention in practice. 

This report is concerned with the analysis of measurements and the 
relation of measurement characteristics to the resulting estimation perfor- 
mance. Past work on such relations has been largely empirical or restricted 
to particular problems. For example, in orbit estimation problems one can 
simulate the estimation problem and then calculate the effect of various types 
of measurements, measurement schedules, orbit geometry, etc., on performance 
(e.g., refs. 2, 3). In several problems, analysis of the equations relating 
the data to properly selected orbit parameters has clarified some aspects of 
performance and computational procedure for these cases (refs. 4, 5, 6). 

The analysis is carried out in the context of the linear estimation 
problem, including assumptions of unforced dynamics and uncorrelated gaussian 
measurement noise. The first section deals with arbitrary sequences of 
measurements. It is found that a group of "equivalent sequences" can be 
derived from the original set of measurements, each of which gives the same 
state estimate and performance. Equivalent sequences permit a number of 
operations on the data and the processing equations; the measurements can be 
compressed to an equivalent sequence of minimum size, certain equivalent 
sequences give the performance, and others permit selective processing of 
information that can be used to separate the estimation calculations into 
independent lower order parts by selectively deleting some information. 

The next section deals with characteristics of the data type. 

Measurement sequences are not always arbitrary sets of measurements but are 
often obtained by sampling one or more data types at some arbitrary sequence 
of times. The data type, given as a function of the state, is then a con- 
straint on the measurements, and its characteristics influence estimation 
performance independent of the particular sample taken and processed. One 
such characteristic is observability; that is, the ability to determine the 
state from some (non-noisy) sampling of the data type (refs. 1, 7, 8). If the 
state is indeterminate from the data type, then unobservable states can be 
defined but a part of the state cannot be estimated from any sampling of the 
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data type. In that case, the order of the data processing equations can be 
reduced by the number of independent unobservable states, and a lower bound on 
the estimation accuracy obtainable from the data type is given by the initial 
estimation errors associated with the unobservable states. 

In the third section, a theory of calibration accuracy in estimation 
problems is obtained as an application of observability analysis. Estimation 
is an exercise in comparing a physical system to internationally defined unit 
quantities (meter, kilogram, etc.). In most problems direct comparisons with 
these unit quantities are carried out in an initial set of measurements and 
the data types are comparisons among functions of the unknowns. The estima- 
tion procedure can then be shown to be one of comparing the physical system 
to intermediate standards of length, mass, etc. These standards are 
unobservable parameters in the problem and their initial estimation accuracies 
provide lower bounds on the accuracy achievable by processing any sampling of 
the data types. An intermediate standard for velocities derived from the 
basic standards is applied to examine estimation of the speed of light as 
part of the orbit estimation problem. 


BASIC FORMULAS FOR THE LINEAR ESTIMATION PROBLEM 


The well-known state estimation procedure reviewed below is based on the 
assumptions that (1) the state is governed by unforced dynamics which are 
either linear or can be described by equations linearized about a reference 
state, and (2) finite sequences of measurements are made, subject to uncorre- 
lated gaussian-distributed noise with zero mean, and processed to obtain the 
minimum variance estimate of the state. 

The state (an n x 1 vector) of a linear system satisfies an equation of 
the form 


x(t) = F (t)x (t) (1) 

Here, F(t) is a known continuous n x n matrix. These are unforced dynamics 
so that the time-history of the state is uniquely defined by the initial 
conditions x(t Q ), as is expressed in the usual solution of (1) 

x(t) = $(t,t 0 )x(t 0 ) (2) 

where $(t,t 0 ) is the transition matrix of the system. 

A scalar output of the system is available for sampling during some time 
period, [t 0 ,tp], and is assumed given in the form 

m(t) = u(t) T x(t) t Q < t < t F (3a) 

or, alternatively 

m C t ) = h T (t)x(t 0 ) t 0 < t < tp (3b) 

where 

h(t) = 3> T (t,t 0 )y (t) 

3 



The vector function (h(t), t 0 < t < tp}, termed the measurement costate 
hereafter, is known and continuous, and it defines the output as a linear 
combination of the components of the initial state. The state dynamics are 
absorbed in the definition of h(t) . If more than one scalar output is 
available, then additional relations of the form (3) are given. 

The data to be processed in order to estimate the state are a (noisy) 
sampling of the output at some finite sequence of times, tj, . . ., tpj, in 
[t 0 , tp] . These data are related to the state by 

Yi = h T (t i )x(t 0 ) + e i i = 1, . . ., N (4) 

where {y^} are the actual data and is the measurement noise, assumed to 

be a gaussian distributed random variable with zero mean and known nonzero 
MS value, q(t^). The sample times are assumed sufficiently separated for 
the noise to be uncorrelated among the measurements. The notation for the 
costates, noise variances, and data from any set of measurements is abbrevi- 
ated to 



It will also be convenient to refer to the two matrices, (H, Q) , as a 
measurement sequence . In the first section of this report, the costates of a 
measurement sequence, H, are taken as arbitrary rather than constrained to 
samplings of a given output function of the form (3) . The second section 
considers the effects of introducing such a constraint. 

This report deals with the processing of measurements with the sequential 
minimum variance equation (ref. 9) . Prior measurements are assumed to have 
been processed and to have yielded the estimate, x , and covariance matrix 

P 0 = E[x“(x") t ] 


where x is the estimation error x - x. The new estimate and covariance 
obtained from processing (H, Q, y) are then 

x = P 0 H[Q + H T P 0 H] _1 (y - H t x~) + x~ 

P = [Pq 1 + HQ - 1 H T ] = P D - P 0 H[Q + HTp o H] _1 HTp o 

The source of the prior estimate and covariance is unspecified here but it is 
assumed that P Q is positive definite. 
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In many problems as, for example, the orbit determination problem, the 
linear system, (1) - (4), is obtained as an approximation of a nonlinear one. 
Since consideration is given to the calibration accuracy limits in such 
linearized systems in a later section, a brief review of the linearization 
procedure is given here. 


A nonlinear state equation and scalar output are assumed given in the 

form 


x(t) = f (x (t)) 

(6a) 

m = m(X) 

(6b) 


The output, m (X) , and the components of f(X) are assumed defined on the state 
space and to have continuous partial derivatives. If more than one output is 
available for sampling, additional functions of the form (6b) are given. The 
state, X, is an n x 1 vector list of independent quantities which suffice to 
define the system. Independence of the state variables means there is no 
nontrivial function, g(X(t)), that is zero at all times. 

To linearize equations (6), first obtain a reference solution of (6a), 
denoted by 

(X A (t), t Q < t < t F } (7) 


This solution is assumed determined as an unbiased estimate of X(t Q ) from 
some set of initial measurements which are arbitrary (not necessarily sam- 
plings of (6b)) and sufficient (they can be inverted to solve for X(t Q )). 
Define the error in this initial estimate 

x(t 0 ) = X(t G ) ~ X^(t 0 ) 

where X(t Q ) is the unknown true state at t 0 ; then 

E[x(t 0 )] = 0 (7a) 

and the initial covariance 

P A = E[x(t 0 )x T (t 0 )] (7b) 


is a (known) positive definite matrix. 

To complete the linearization, define the state deviation 

x(t) = X(t) - X A (t) to < t < tp 
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The state deviation is assumed sufficiently "small" so that it satisfies the 
linearized state equation (1), with F(t) given by the gradient of (6a) with 
respect to the state variables 


F (t) 



1 , 



( 8 ) 


Here, the subscripts, i, j, denote the components of the vectors f(X) and X, 
respectively. Further, the linearized output (eq. (3)) is obtained from the 
gradient of equation (6b) by using 


p(t) = [Vm(X) ] x ^(t) ( 9 ) 

Equation (3) then gives the first-order difference between the output for the 
true state and the output for the reference state. 

Finally, the estimation procedure is carried out by processing samplings 
of the output with equation (5). The processing is begun with the initial 
state estimate and covariance given by equations (7a) and (7b) . These sam- 
plings can be processed in equation (5) in any convenient order - sequentially, 
in groups, or all at once. If the processing is done sequentially, then x” 
and P 0 in equation (5) refer to the results of processing measurements prior 
to the current one. 


EQUIVALENT SEQUENCES AND DATA 


This section deals with the analysis of measurement sequences and 
estimation performance, that is, of equation (5). The restriction of measure- 
ment sequences to samplings of one or more data types is not imposed so that 
the discussion applies to arbitrary finite sequences. The principal tool in 
the analysis is the equivalent sequence and data, that is, any set of measure- 
ments and data (possibly fictitious) which, when processed, give the same 
state estimate and covariance as the real sequence and data. The equivalent 
sequence and data is used in the analysis to compress large sequences into 
small equivalent sequences, and to obtain equivalent sequences with special 
properties such as those which state the estimation performance and those 
which extract information on any specified set of parameters. 

The basic quantities used in this section are defined next. A parameter 
is any linear combination of the state variables, h^x, and is specified by a 
vector (costate), h, whose components are the constants of combination. If 
(H, Q, y) is any finite measurement sequence and the corresponding data vector, 
then the columns of H specify the parameters measured by the sequence and 
the column space of H is called the observable parameter space expressed as 

^ = JE{H) 
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The information matrix for a sequence is 


I f = hq _1 hT 


A fictitious sequence, (Hg, Qg) , is said to be equivalent to the real sequence, 
(H, Q) , if they have identical information matrices. That is, if 

HgQj^Hg = HQ -1 H T 


The noise matrix, Q £ , is assumed diagonal here. Both sequences give the same 
covariance when processed in equation (5) . 

If (Hg, Q E ) is equivalent to the real sequence (H, Q) and y is the real 
data, then the corresponding equivalent data, y E , is that fictitious data 
vector such that (Hg, Qg, y E ) yields the same estimate of the state in 
equation (5) as does (H, Q, y) . 


Equivalent Sequences 

Any real sequence can be replaced by a variety of equivalent sequences, 
all of which have identical information. For example, the measurement noise 
can always be absorbed in the h vectors by defining 

H* = HQ _1/2 


Then (H f , I) is equivalent to (H, Q) . Such sequences with unit mean square 
noise are called primary. It is apparent that any square root 1 of the 
information matrix is an equivalent primary sequence. 

Sequences equivalent to (H, Q) can consist of r measurements, where r 
can be greater or less than the number of measurements in (H, Q) . There is 
no upper limit, but a lower limit r* exists and is 

r* = Rank(H) (10) 

An equivalent sequence containing r* measurements is called an equivalent 
basis since its costates are a basis of the column space of H. 

A series of statements concerning the algebra of equivalent sequences 
follows next. Some minor proofs are omitted. The notation (H^, Q A ) = (Hg, Qg) 
means the two sequences are equivalent. Where primary sequences are involved, 
the noise matrix is dropped from this notation. 


^he square root of a positive semidefinite matrix, M, refers to any 
matrix such that = M. 
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[1] Equivalent sequences span the same space. If H a = H then 

f{H A } = £{%}. B 

[2] Let [H a • H g] be a partitioned primary sequence. If H A = H c then 
[H A lH B ] = [H C S H b ] . 

[3] Suppose [H A jH B ] = [H^’Hp]. Then H A = Hq if and only if H B = Hp. 
These statements are also true for nonprimary sequences. 

[4] Every sequence has an equivalent basis. 

Note that the information matrix is positive semidefinite. Any square 
root of Ip which has maximal rank is an equivalent primary basis . The 
mathematical literature may be consulted for various methods of computing 
square roots. One interesting method is described in reference 10 and is 
reviewed in appendix A. 

[5] Let H A be any equivalent primary basis of (H, Q) . Then Hg is 
also an equivalent primary basis if and only if there exists an orthogonal 
r* x r* matrix, B, such that 


h a = H B B 

To prove [5], assume, first, that H A is an equivalent primary basis of 
(H, Q) and is related to H B by H A = H B B where B is orthogonal. Then 
Ha, H b are equivalent since they yield the same information matrix, whence 
H b is also equivalent to (H, Q) . Conversely, assume H A , H B , are equivalent 
primary bases of (H, Q) . Then both are bases of the same subspace, and 
are related by some nonsingular matrix, B, of size r*; H A = H B B. Since H A 
and H B are equivalent, it follows that BB T is the unit matrix, whence B 
is orthogonal. 

[6] Let H A , H* be any pair of primary equivalent bases of some 
sequence (H, Q) , and let Hg, Hn be any subsequences of H A , H A , respectively. 
Then Hg = Hg if and only if T{H B } = £{Hg}. 

f 

To prove [6], note first, that if Hg = Hg then they span the t same space, 
(statement [1]). Conversely, suppose f{Hg} = <C{Hg}. Since H A , H A are 

equivalent primary bases, then (statement [5]) they are related by an 
orthogonal matrix, B. 


H A = h a b 


This equation can be partitioned as 


[Hb|Hr] = [%•%] 


B 1 ! B 2 
B 3 ! B 4 
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or 


Hb = H b B! + H R B 3 

Since Hg, Hg are both bases of the same space then B 3 is zero. Since B 
was orthogonal then, B} is orthogonal and Hg, Hg are equivalent 
(statement [5]). 

As seen in [5], many equivalent basis sequences are available and 
several with special properties are discussed in later sections. Basis 
sequences compress the original set of measurements to r* measurements 
which can be used in the data processing in place of the real sequence. 


Equivalent Data 

Suppose (Hg, Qg) is any sequence equivalent to the real sequence (H, Q) . 
The corresponding equivalent data vector, yg, is that fictitious data vector 
for which the state estimate, x, obtained from processing (Hg, Qg, y £ ) in 
equation (5) , is identical to the estimate obtained from (H, Q, y) . 

The equivalent data are more easily derived from an alternate formula for 
the state estimate. Let 


H t = [H 0 j H] , 




that is, (H T , Q-p, y T ) is the total set of all prior and current measurements. 
The minimum variance estimate of the state from the total set of 
measurements is (ref. 9) 


[H T Q^ 1 H^]' 1 H T Q T 1 y T 


This estimate is identical to that given by equation (5) . The vector yg is 
defined by the condition that the state estimate be unchanged_when (H, Q, y) 
is replaced by (Hg, Qg, yg) in the above formula. Since H-gQ^HT is the 

total information matrix, which is nonsingular and which is unchanged if any 
subsequence is replaced by an equivalent subsequence (statement [2]), then 
y^ satisfies 


[H 0 ;H E ] 


_° ! q e _ 
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or, after expansion, 

H E QeV e = hq'V 

If (H E , Q e ) is an equivalent basis, then there is a unique solution for y £ 
since every column of H can then be given as a unique linear combination of 
the columns of H E ; that is, H can be given as 


H = H e M (11) 

T 

Substitute this in the preceding equation and premultiply both sides by HgP o 
to obtain 

X E = Q e MQ"V (12a) 

valid for equivalent basis sequences. The result depends only on the real 
sequence and data and is independent of prior measurements. Alternatively, 
the numerical pseudo-inverse (ref. 11) of H E is 

h e - Ch‘h e ) h‘ 

whence the solution for y^ can be given as 

y E = Q E H E HQ"V (12b) 

T T 

valid for equivalent basis sequences. The operations HgHg, H*H that appear 

in this solution are often physically undefined owing to the physical units 
commonly associated with components of h vectors, but the result in equa- 
tion (12b) is numerically correct since the physically undefined operations 
appear in H*H in cancelling pairs. In fact, the product H^H is the 

well-defined matrix M of equation (11). 


The Uncorrelated Equivalent Basis Sequence 

An unoorvelated equivalent basis of the real sequence (H, Q) is any 
primary equivalent basis H E for which H^P Q H E is diagonal. The costates of 

such a sequence define a set of parameters 

(hjX, j = 1, • • • , r*} 

whose estimation errors are uncorrelated both before and after (H, Q) is 
processed. It is shown that every real sequence has such a basis and it is 
usually unique, given the real sequence and prior covariance. Further, the 
elements of the diagonal matrix state the performance in reducing the mean 
square estimation errors from their prior values specified by P Q . 
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[7] The uncorrelated equivalent basis. Every sequence has an 
uncorrelated equivalent basis given as follows: Let Hg be any equivalent 
basis of (H, Q) and let P Q be the prior covariance matrix. Define the 
matrix 


A = 


H E P o H 


E 


and let A, B be its eigenvalues and modal matrix. Then the matrix 


h eu - h e b 


is an uncorrelated equivalent basis of (H, Q) and 


H 


T p u 
EU o h EU 


A 


h eu ph eu = M 1 + A ] _1 

To prove [7], note that P Q is positive definite and symmetric, and 
that H e always exists and is an n x r* matrix of rank r*. Then A is a 
positive definite symmetric matrix of size r* . Let its eigenvalues and modal 
matrix be A, B; that is, the columns of B are the eigenvectors of A taken 
in the same order as the eigenvalues in the diagonal matrix A. The matrix B 
is orthogonal due to the symmetry of A and it then follows from statement [5] , 
that H^B (i.e., H E y) and H E are equivalent primary sequences. Therefore, 

H E B is equivalent to (H, Q) , is a basis, and makes P Q diagonal. Of the 
final two equations in statement [7] , the first is true by construction of H E y, 
and the second follows readily from equation (5) after (H, Q) is replaced by 
its equivalent (H E y, I), and is formed. 

The r* columns of H E y are costates which define r* parameters whose 
prior estimation errors are mutually uncorrelated and whose posterior errors 
are also mutually uncorrelated. This follows since, as a result of 
statement [7] 

h i P o h j = h I Ph j = 0 

for h E , hj any two distinct columns of %y. Further, after processing 
(H, Q) the MS estimation errors of the parameters defined by H E y are 
reduced by the respective factors { (1 + A^) 1 , i = 1 , . . ., r*}. 

The uncorrelated basis is a unique characteristic of a sequence (H, Q) , 
and the prior covariance, P Q . 

[8] Uniqueness of eigenvalues. Given (H, Q) and P Q , the eigenvalues of 
the matrix 

A (He) = h£p o h e 

are invariant for all primary equivalent bases, Hg, of (H, Q) . 
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Noting statement [5], all such matrices, A(H E ), are related by 
orthogonal transformations and, therefore, have identical eigenvalues. 

[9] Uniqueness of the uncorrelated equivalent basis. Given (H, Q) and 
P Q , the uncorrelated equivalent basis Hgy defined by [7] has as many unique 
columns, except for ordering and sign, as A has distinct eigenvalues. 

Distinct eigenvalues are those which appear in A only once. The 
occurrence of multiple eigenvalues is accidental so that normally H E u is 
unique except for ordering and sign. To prove [9], let H El , Hg 2 be distinct 
equivalent bases of (H, Q) . These are related by some orthogonal matrix C 
(statement [5]) so that 

A(H E1 ) = C t A(H E2 )C 

If Bi and B 2 are the modal matrices of AfHgj) and A(H E2 ), respectively, 
then 

A = B{CA(H E2 )CB 1 = B 2 A(H E2 )B 2 

Therefore, both B 2 and CB^ are modal matrices of A(Hg 2 ) • If the eigen- 
values A are distinct, the modal matrix of A(Hg 2 ) is unique except for the 
signs of its columns. If signs are ignored, 

B 2 = CBi 


then 


h EU 2 | H E2 B 2 " H E2 CB 1 “ H E1 B 1 = H EUl 


that is, the uncorrelated bases obtained from Hgj, H E2 are identical. If A 
has an eigenvalue of multiplicity s, the corresponding s eigenvectors can 
be any orthogonal basis of the same s-dimensional eigenspace. In that case, 
s columns of H E Ui and H E y 2 need not be identical, although they are 
equivalent, while the remaining columns are identical. 

In the next section, the uncorrelated equivalent basis gives the 
performance with which (H, Q) reduces estimation errors. 


Observability and Estimation Performance 

Performance, in this section, refers to the reduction of MS estimation 
errors obtained from processing a sequence. The analysis is based on the 
decomposition of the state and costate (dual of x) spaces into subspaces 
termed observable and unobservable to the sequence. Estimation errors and the 
covariance matrix are similarly decomposed, and the effect of processing the 
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sequence is to reduce only the observable errors, the amount of reduction 
being stated by the uncorrelated basis sequence and its eigenvalues. 

Decomposition of K and X- The n-dimensional costate space K and 
state space X are partitioned into the following subspaces whose definitions 
are based on the costates, h, of the measurements to be processed and on the 
(nonsingular) prior covariance, P Q : 


Xm = JC{H} 

(13a) 

X u = {x: x T H = 0} 

(13b) 

5^ h {h: H T P Q h = 0} 

(13c) 

Xm = ^{P 0 H> 

(13d) 


The subspace lC m is the r*-dimensional column space of H - the space 
spanned by the measurement costates - and is termed the observable parameter 
space. The subspace Xu is an (n - r*) -dimensional subspace of X, termed 
the unobservable states. These two subspaces are properties of only the 
measurement costates and are well known in discussions of observability (e.g., 
ref. 7) . 

The remaining two subspaces JC U and Xm are termed the unobservable 
parameters and observable states, respectively , and are defined from the prior 
covariance as well as the costates of the measurement sequence. 

The subspaces defined in equations (13) provide a complete decomposition 
of 1C and X since they satisfy 

!C = l{m©JCu 

X = X m © X u X m n x u = (0} 

In that case any costate h or state x can be uniquely decomposed into a 
sum 

h = h m + h u 

X = x m + X u 

whose parts are, respectively , in or X m , X u . Further, if H m , are 

two matrices whose columns are bases of respectively, then related 

bases of X m , X u are given by the columns of the matrices 

X m = PoV X u e P q H u (14) 
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The decomposition of K and X defined in equations (13) is "orthogonal” in 
the sense that the inner products 

Ol>h 2 >= h ]P 0 h 2 

Oi,x 2 > = x{p-1 X2 

are zero for all pairs (hi, h 2 ) or (x^, x 2 ) having one member in 5f m (or x m ) 
and the other in 3f u (or X u ) • For the inner product on 3C, a "distance" 

>^h7^ is, physically, the RMS prior estimation error of the parameter 
defined by h. Two costates are orthogonal 


<Chi ,h2> = 0 

provided the prior estimation errors of the two parameters defined by h]_ and 
h 2 are uncorrelated. Thus, the estimation error for any parameter whose 
costate is in is uncorrelated with the estimation error for any parameter 

whose costate is in including the measured parameters, and, hence, cannot 

be reduced by processing the measurements. 

T T 

The usual inner products, h 1 h 2 , xjx 2 are often physically undefined in 
estimation problems because of the physical units associated with state 
variables. 

It may be noted that the data are independent of that part of the state 
which is in X u and depends only on that part which is in Xm- As noted 
earlier, the state can be uniquely decomposed 


x = x m + x u 


and since, by construction, all measurement costates h m are in 3^, we have, 
using equation (13b) 


Reduction of estimation errors- The state estimation error, x, can be 
uniquely decomposed into parts in X m and X u 


X = x m + x u 

If, next, H m , H u are any bases of 3^, 3^, and X m , X u are defined in 
accordance with equation (14) , then the prior covariance of x can be given 
by 


P 0 = XmtHjPolU^xJ + X u [HjP 0 H J^xj 


(15) 
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or, simply 


This equation is an identity obtained by defining the matrices 

6 = [% j H U ] and V = [x m I X u ] = P o 0 

where 0 and T are n x n nonsingular matrices by construction. Next, form 
the identity 

P 0 = P o 00" 1 Po 1 0“ T 0 T P o = V |0 T P o 0j _1 'i ,T 

which, after expanding and noting equation (13c), yields equation (15). 

Equation (15) gives P Q as a sum of two positive semidefinite matrices 
of rank r* and n - r*, respectively. This decomposition is unique in that 
the two matrices, P m , P u , are independent of the particular bases of and 
?f u chosen for the construction. Further, it can be verified that these two 
matrices are, respectively, the prior covariances of x m and x u . 

If (H, Q) is some sequence to be processed and its uncorrelated equiva- 
lent basis is now chosen as the basis of used in equation (15), the 

following is readily established. 

[10] Let H Eu , A be the uncorrelated basis and eigenvalues of (H, Q) 
and P 0 , as defined in statement [7]. Let H u be any basis of and 

define 

x m = p o h EU 

p u = PoHutHjPoHul^HjPo 
Then the prior and posterior covariances are 

P o = VA" 1 ^ + Pu 

P = X m A~ 1 [I + A ] _1 xJ + P u 

The above expansion of P is obtained by using statement [7] and equa- 
tion (15) in equation (5). As seen in statement [10], the covariance of x u , 
given by P u , is unreduced by processing (H, Q) . The prior covariance of 
x m can be written 



i=i 
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where {X^} are the columns of *m here. The processing reduces each term in 
this sum by the respective factors { (1 + Ai) 1 , i = 1, . . .,r*}. 

T 

The performance in estimating any parameter, h x is defined as 

h T Ph 


P = 


h T P 0 h 


or the ratio of posterior and prior variances of h^x. All values of p are 

in (0, 1] and performance is better for smaller values. 

A costate, h, can be given as the unique sum of its parts in 3f m and 5^ 


h = h m + h u = H Eu a + h u 

where h m has been given in terms of the basis, H E( j, of 3^. Using state- 
ment [10], one obtains for any h 

r* 


E KAiHi + *i)" 1+ h J p u h u 


p = 


i=l 


(16) 


Z C“iAi) * 

i=l 

where {ct^} are the components of a. From (16) it is apparent that 

1 ^ ^ 1 


1 + A- 


< P 


max 


1 + A- 


mm 


for heK^ii 


(17) 


p = 1 for h&Ku 

The extreme eigenvalues thus give the extremes of performance for parameters 
whose costates are in 3C m while no reduction of MS estimation error occurs 
for parameters with costates in More generally, if h is in neither 

subspace (h m and h u are both nontrivial), then performance is in the interval 

(1/1 + Amaxj 1) • 

The uncorrelated basis is equivalent to the real sequence (H, Q) in the 
sense that the same estimate and performance would be achieved if the measure- 
ments defined by (Hgy, I) were taken instead of the sequence (H, Q) . The 
eigenvalues { Ai } are the MS signal to noise ratios for the measurements 
defined by (H E y, I) where the signal is the prior estimation error for each 
parameter measured by Hgp and the MS noise is 1.0. The eigenvalues depend 
on the nature of the prior information, as given by P Q , as well as on the 
sequence being processed. If Ai >> 1 then the measurement is a good one 
relative to the signal and good performance is obtained for the parameter 
defined by the corresponding column of H E jj. The converse occurs for those 
parameters corresponding to small eigenvalues. If the eigenvalues range from 
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very small to very large then computational difficulties may occur in 
processing the sequence. Such difficulties appear to be inherent in many 
problems of practical interest* especially in orbit determination. An inter- 
esting method of alleviating such difficulties based on the size of 
eigenvalues is reported in reference 12. 

Influence of the Initial covariance on estimation- Equation (5) is used 
for sequential estimation, beginning with the initial conditions x^, P^. For 
linear systems obtained as approximations of nonlinear ones, it is assumed 
that x^ is zero and P^ is, in principle, determined from the measurements 
used to compute the reference solution of the nonlinear system. However, P^ 
is often guessed rather than calculated from the initial measurements. This 
can be done if the final estimate is approximately independent of the initial 
covariance, which occurs when the information from subsequent measurements is 
much greater than the initial information. 

The posterior information and estimate from processing (H, Q, y) are 
(from eq. (5)) 


P -1 = P^ 1 + hq _1 h t 
x = PHQ - 1 y 

The condition, = 0, is used in the second equation above. If the 
information from (H, Q) is much greater than the initial information 

hq -1 h t » p^ 1 

Then P^ has negligible influence on the estimate 

x a (HQ -1 H T ) -1 Q _1 y = H” T y E 

where (H^, I, y^) is equivalent to (H, Q, y) . This requires that the whole 
space be observable to the measurements, H, and if P^ 1 is expressed in the 
form 


P' 1 = H e (MM t )h£ 

MM^ has eigenvalues much less than 1.0. 

More generally, if the whole space is not observable then the initial 
information can be separated by selecting H m , H u to be any bases, respec- 
tively, of the observable and unobservable subspaces for the measurements, H 

p; 1 = ^[Hjp^rvj + i u 
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where 


lu - «u<WV 

The two parts of P^ 1 are the initial information matrices associated, 
respectively, with and W u and they are independent of the particular 
bases chosen for the construction. When H m is the uncorrelated basis for 
(H, Q) and P^, the posterior covariance becomes 

p 1 = HeuC 1 + A 1 ] H eu + I u 


If the eigenvalues satisfy the condition that 

>> 1 i = 1 , . . . , r* 


then 


p" 1 = h eu h|u + i u = hq _1 h t + i u 
x = [I u + HQ _ 1 H T ] _ 1 HQ -1 y 

The estimation is approximately independent of the initial information 
associated with but does depend on initial information associated with 

3C U . Nevertheless, the estimate, h T x, of any parameter in 5^ is approxi- 
mately independent of all initial information. This can be verified if we let 
Hm, H u be, respectively, any equivalent primary bases of (H, Q) and any 
square root of I u > so that 

P' 1 = HQ _1 hT + I u = [H m j H u ][H m | H U ] T 

Since both h and the measurements H are in K m , they can be given uniquely 
in terms of H m in the form 


h = H^a H = H m M 

whence 

h T x = h T PHQ -1 y = a T MQ _1 y 

This result is approximately valid for all h in and is independent of 

P A . Note that the product, a T M, is independent of the choice of equivalent 
primary basis, H m , used in the construction. 
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Secondly, if only some of the eigenvalues are much greater than 1.0 then 
the estimate, h T x, is independent of Py^ only for those parameters for which 
h is in the subspace spanned by the columns of H E y corresponding to these 
eigenvalues. 

Finally, this discussion assumes that is "reasonably" guessed so 

that at least the orders of magnitude of eigenvalues are correct. If some 
eigenvalues are small or a subspace is unobservable to the measurements, then 
errors in the guessed Py^ will affect the estimate. 


Selective Information Processing 

A given measurement sequence has many equivalent bases and one can be 
found such that some of its measurements are in any specified subspace of 5C 
while the remaining are outside that subspace. Information consisting of 
measurements of parameters in the specified subspace can therefore be 
separated from the original sequence. This procedure can also be used to 
separate the processing equations, (5), into independent lower order parts by 
extracting and deleting certain information. 

[11] Information separation. Let (H, Q) be any given measurement 
sequence and any given subspace of It, and let 

k = dim(ft^ n 5^) 


An equivalent basis sequence of (H, Q) exists having k columns which are a 
basis of n 

The proof of [11] in appendix B outlines the computation of the required 
equivalent basis; that is, of a partitioned primary basis 

([ham j Hr] , i) = (H, Q) 

in which the k columns of form a basis of 5Cy\ n while the 

remaining r* - k columns, %, are not in (r* is the rank of H) . The 

partitioned equivalent basis generated from [11] separates the information 
matrix into the sum 


HQ-'hT = U m H J m * H r hJ 

The two parts of the sum are unique and have rank k and r* - k, respectively. 

The capacity of the original sequence to reduce the estimation errors of 
parameters in is not generally isolated in the extracted measurements. 

Ham- This is seen as follows. The uncorrelated complement space of 
given by 
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Wfi = {h: h T P o H A = 0} 

where Ha is any basis of M A . The space Mg is the set of costates of all 
parameters whose prior estimation errors are uncorrelated with the prior 
estimation errors of every parameter with costate in MX. The columns of 

are in M A aiK *’ if processed, contribute nontrivial error reduction for 
parameters with costates in M A . The columns of Hr are not in MX but, if 
processed, also reduce errors for parameters whose costates are in M A except 
in the special case that the columns of Hr are in the complement space, Mg. 

On the other hand, error reduction in Mg can be prevented by extracting 
and processing only the measurements. Ham* since these are all in the comple- 
ment space of Mg. If this is done, then the posterior estimation errors for 
parameters whose costates are in MX will remain mutually uncorrelated with 
those whose costates are in Mg. 

The preceding remark can be elaborated to separate the processing 
equation into two independent lower order parts associated with M A and Mg. 
This is done by a two-part separation of (H, Q) into 

(H, Q) = [ Ham j Hr] 

Then Hr is further separated into 

Hr - [hrb i Hr] 

where HRg is a basis of f {H r} n Mg and Hr is the final remainder with 
columns outside both M A and Mg. Thus, an equivalent basis of the form 

(H, Q) = ([Ham J H rb | Hr] , i) 

is obtained. The subspace 3^ is unobservable to the measurements H R g, and 
is unobservable to Ham- The remaining measurements H R contain that 
information which causes the posterior estimation errors of parameters with 
costates in 31 a to become correlated with those of parameters having costates 
in 3%. Consequently, if is deleted from the data processing then the 

calculations (eq. (5)) can be separated into two independent parts of order 
dim (31 a) an ^ n -dim (31 a) which are associated, respectively, with the estimation 
of parameters having costates in 31 a from the measurements Ham and with the 
estimation of parameters having costates in 31g from the measurements H R g 
(see appendix B) . This separation is maintained if a series of measurement 
sequences is separated and processed in the same way. 
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Discussion 


Equivalent sequences and data are used here to obtain a reasonably 
unified view of many operations that can be applied to the processing of 
arbitrary finite sets of measurements. Equivalent sequences and data are any 
set of measurements and data (fictitious) which, when processed, give the same 
state estimate and covariance as the real sequence and data. An equivalent 
basis sequence is one which has the fewest number of measurements possible for 
an equivalent sequence, this number being the rank of the measurement costates 
of the real sequence. Basis sequences can be determined using statements [4] 
and [5] and equation (12) . Several equivalent basis sequences with special 
properties were derived (e.g., the uncorrelated equivalent basis which can be 
used to describe performance, and bases which separate the information into 
measurements which are either in or outside of any selected subspace of the 
costate space) . 

Practical applications of these analytical results are not suggested as 
this matter is beyond the scope of this report and success is difficult to 
predict without computational experimentation. Computational efficiency 
might be improved in some problems by compressing the obtained information to 
an equivalent basis sequence prior to processing in equation (5) . Equivalent 
bases similar to the uncorrelated basis are already known in the literature 
and applications to treating computationally ill-conditioned problems have 
been suggested (ref. 12). Finally, the information separation procedure can 
be used when only a few parameters are to be estimated and sufficient informa- 
tion on these parameters can be separated from the original sequence and 
processed. This procedure can also be used to separate the data processing 
equations into independent lower order parts which might prove useful in 
problems that separate naturally into nearly uncoupled parts. 

In this section the data were analyzed in a context that is, perhaps, too 
general to yield much insight into the nature of estimation. In particular, 
the measurement sequences are assumed to be arbitrary, but, in practice, mea- 
surements are often obtained by sampling one or more data types at some arbi- 
trary sequence of times. The data type, given as a time-dependent function of 
the state, is then a constraint on the measurements, and its characteristics 
are common to any sampling of the data type. For example, some states may be 
unobservable to the data types and, therefore, to any sampling. In the next 
section some elementary aspects of the analysis of observability for data 
types are considered. 


DATA TYPES 


Measurements considered in this section are restricted to samplings of 
one or more known scalar outputs or data types, of the form of equation (3), 
at some arbitrary finite sequence of times in an interval [t Q ,tf]. It is 
shown that the state and parameter spaces can be decomposed into subspaces 
termed observable and unobservable to these specific data types in a manner 
analogous to that given earlier for arbitrary measurement sequences. The 
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states which are unobservable to the data types are also unobservable to any 
sampling of the data types. If unobservable states occur the estimation 
equation (5) can be reduced in order by the dimension of the unobservable 
subspace since the problem can be reduced to one of estimating a lower order 
observable subsystem. 


Observability Characteristics of Data Types 

The measurements processed with equation (5) are restricted to samplings 
of one or more scalar data types of the form 

m(t) = h(t) T x(t Q ) t 0 < t < tp 

where the measurement costate (h(t), t 0 < t < tp} is a known, continuous 
vector function. If more than one data type is available for sampling, then 
additional vector functions {h^(t), i = 1, . . . , k} are given. 

The system is said to be completely observable on [t 0 , tp] provided the 
state can be determined from the (noise-free) output (m(t), t 0 < t < tp} 

(ref. 7). The system is completely observable on [t 0 , tp] if, and only if, 
the integral 

M(t 0 ,tp) = f tF h(t)h T (t)dt 
to 

is nonsingular or alternatively, if there exists at least one set of n times 
in [t 0 , tp] such that the matrix of measurement costates 

[h (t ! ) , . . ., h(t n )] 


is nonsingular. 

The elementary definitions and conditions for an observable system are 
somewhat more extensive than the above summary (e.g., refs. 1, 7, 8), but for 
present purposes the summary suffices. In many practical problems computa- 
tions to determine observability are difficult but this subject is beyond the 
scope of this paper. 

The concern here is with systems that may or may not be observable. If 
the data types (hi(t), i = 1, . . . , k} are available for sampling then each 
data type spans a subspace of 5C 

3f m . = £{hi(t), t Q < t < t F } (18) 

called its observable parameter space. A basis of this space is given by the 
row or column space of the matrix 
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Mi(t 0 ,tp) = / F hi(t)hT(t)dt , i = 1, . . k (19) 

t O 

The observable parameter space for the collection of data types is the direct 
sum 


3<n, = 3^! ® «Jn 2 ® • • • ® 3(^ k (20) 

which is the space spanned by the collection of bases computed from equa- 
tion (19). If 3C m is r-dimensional , then there exists a set of r times in 
[t D , tp] and corresponding data types such that the corresponding measurement 
costates are a basis of M m . Any other sequence spans a space included in 

The unobservable state space is defined from the data types by 

X u = {x: x T hi(t) =0; t D < t < tp; i = 1, . . . , k} (21a) 

If a basis, H m , of 3^ is given then X u can be determined from 

X u = {x: x T H m = 0} (21b) 

An orthogonal decomposition of K and X is completed by defining 

3^ = (h: h T P A H ra = 0} (22a) 

Xm = £{P A H m } (22b) 


Here, refers to the initial covariance used in the sequential processing. 
The matrix, P^, results from some sufficient set of initial measurements of 
unspecified type, but all subsequent measurements are assumed samplings of the 
data types. The subspaces defined in equations (22) are based partly on the 
initial statistical properties of the estimation errors as given by P^, which 
specifies those parameters whose initial estimation errors are correlated with 
the initial estimation error of any parameter that can subsequently be 
measured by sampling the data types. 

The subspaces defined by equations (20) to (22) are analogous to those 
given by (13) for arbitrary measurement sequences. It is readily verified 
that 


H ® and ^ 3C\x — 0 

so that any costate, h, can be given uniquely as a sum 

h = h m + h u 
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whose parts are, respectively, in X m and X^. Similar statements apply to 
X, X m , X u and any state x. In addition 


dim P*m] = dim[X m ] = r 
dimp^j] = dim[Xu] = n - r 

where n is the dimension of the state space. These subspaces are orthogonal 
decompositions of X and X in the sense of the inner products 

T rj 1 “ 1 

<hi,h 2 > = h x P A h 2 and<x : ,x 2 > = x*P A x 2 

Finally, related bases of the subspaces are given by 

Xm = an< ^ X u = p a^u (23) 

where H m and H u are any bases of and ftu, respectively . 


Reduction to an Observable Subsystem 

If unobservable states occur then a suitable transformation of the state 
space gives the data types in terms of a reduced set of new variables and the 
estimation equations (5) are separated into trivial and nontrivial lower 
order equations. Only the nontrivial equations need be carried in the com- 
putations. To accomplish this, define the partitioned transformation of the 
state space 

x = [x m j Xu] ( 24 ) 


in which Xm (r columns) and X u (n - r columns) are, respectively, bases of 
the subspaces X m and X u given by equations (21) and (22) . Any state can be 
given as a linear combination of the columns of X 


x 


= [*■ i 



(25) 


This gives x as the sum of two vectors, one each in X m and X u . The com- 
ponents of w m (r x 1 vector) and w u (n - r x 1 vector) are the components 
of x in X m and X u , respectively, for the particular bases of these 
subspaces chosen for the transformation, (24) . 


The costate of any measurement permitted by the data types is in 3^; 
hence, measurements can be given as 
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+ e 


y = h T x + e = h T X m w m 


so that measurements can be given as linear combinations of the r new 
variables of w m and the linear combination is defined by the r x 1 vector, 
X^h. Consequently, any measurement sequence and its data (H, Q, y) can be 
given as (M, Q, y) , where 


M = xJh 


(26) 


The initial covariance of w is obtained from the transformation 


w A = 





(X^P^Xjn )" 1 

! o 

i 


o 

| ( X I P A lx u) _1 _ 


(27) 


T -1 

The off-diagonal submatrices are zero since X u = that is, the two 

subspaces are orthogonal. The nontrivial parts of are the initial covar- 
iances of w m and w u and are represented hereafter as and W u ^. If any 

sequence of measurements and its data (H, Q, y) are now processed and the 
resulting posterior covariance P is transformed, one obtains the posterior 
covariance of w as 


W = 


OVa + mq" 1 m t )~ 1 ! o 

1 

0 i w uA 


and the estimate w is given by 

= W mA M [Q + M T W mA M]-V 


w u = 0 


Finally, if subsequent batches of measurements are processed sequentially then 
the transformation, (24), maintains the separation of the processing equations 
at every step 


w m = W m0 - ^m 0 M [Q + M^W mo M] ” 1 M T W mQ 

w m = W m0 M[Q + M T W mo M] _1 (y - M T w”) + w~ 


(28) 


W, 


U 


w, 


u 



(29) 


25 



iiiini 


Here, W m0 and w m are the prior covariance of w m and estimate of w m 
obtained from processing all earlier measurements, and (M, Q, y) is the 
current batch transformed by equation (26) . Only equation (28) describing the 
r variables w m need be carried in the computations. These are entirely 
analogous to the original processing equations, (5), and are smaller in order 
than (5) by the dimension of ?f u . 

It is usually more convenient to begin with bases of 5^ and 5^ and then 
obtain Xm and X u from these bases by using equation (23) . If this is done, 
then the new variables w m , w u can be written 

w m = (HmP A H m ) _1 H^x 

w u = C H uW' lH S x 

that is, the new variables are parameters with costates in and The 

basis, H m , can always be selected so that is the unit r x r matrix. 

If this is done then 


WmA = 1 


and M can be computed from 


M = h!,H 
m 

(This result is established by noting that M = hJp a H = (H^P A H m )H^H = H*H.) 
Thus, to carry out the processing with the reduced equations (28), it is 
necessary to calculate one matrix, H^, which is then applied to the costates 
of each measurement sequence, H, subsequently processed. 


The covariance and estimate of the original state variables, x, is 
readily recovered from W, w at any step 


p = xwx T = x m w m xj + X^xj * 


x = Xw = X m w m 


(30) 


The estimated state is always an element of X m that is, is an observable 
state. The covariance is separated in equations (30) into two parts, this 
separation being independent of the particular bases of x m , X u used in the 
transformation. The covariance of x u , given by 


p u = X u W uA X^ 
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is unreduced from its initial value by processing any measurements obtained 
from the data types. Therefore, P u is a lower bound on the MS estimation 
errors and, in particular, its diagonal elements are lower bounds on the MS 
errors to which the original state variables can be estimated from the data 
types. 

A potential source of progressive accuracy loss in calculations is 
apparent in (30) . Assume that X u is nontrivial and that the estimation 
calculations are not separated as in (28) and (29). As data are then processed, 
the matrix W m in equations (30) is reduced while P u remains fixed. This 
can result in the progressive loss of numerical significance and even the rank 
with which X m W m )(T is retained in the calculated covariance matrix. The 
estimate, x, depends on X m W m X^ and not on P u making it subject to progres- 
sively larger calculation errors. The removal of unobservable states from the 
data processing, as in equation (28), removes this source of calculation 
errors . 

An example is given in appendix D to illustrate the application of 
observability analysis described in this section. 

Discussion 

The data type refers to a scalar function of time and the initial state 
whose sampling at some arbitrary finite set of times in an interval [t Q , tp] 
provides the measurement sequence used to estimate the state. The data type 
is then a constraint on the measurements and all samplings possess common 
properties as a result of the constraint. 

One such property is observability which refers to the sufficiency of the 
data type for determination of the state from some sampling in the interval 
[t 0 , tp] . The observable parameters and unobservable states are defined from 
the data type. The initial covariance matrix is introduced and then the 
unobservable parameters and observable states can be defined. These four sub- 
spaces define an orthogonal decomposition of the parameter and state spaces 
and permit separation of the processing equations into trivial and nontrivial 
parts associated with estimating the observable and unobservable components of 
the state. If the system is not observable to the data type, the problem can 
be reduced to one of estimating the lower order observable subsystem. 

This section has considered only basic definitions and the immediate 
effects of unobservable states in the data processing; it is by no means as 
extensive as one expects the analysis of data types eventually to be. It is 
possible to distinguish between complete and total observability (ref. 7) . 

The observable space, K my is completely observable in [t Q , tp] if it is 
spanned by the data type during this interval, and is totally observable if 
it is spanned during every nontrivial subinterval of [t Qj tp] . Apparently, 
the space spanned by the data type during [t Q * tp] is totally observable, pro- 
vided the data type can be expanded in a Taylor series on [t Q , tp] , and this 
is often the case in estimation problems. In addition, only the integral con- 
dition for observability has been mentioned, but conditions in terms of 
derivatives of the data type costate are also available, by way of the duality 
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between controllability and observability, from the results in reference 13 
and elsewhere. Computations to determine the space observable to a data type 
are difficult at best in many practical problems, but the matter of computa- 
tionally efficient methods is beyond the scope of this report. Finally, the 
characteristics of the data type which influence performance when all states 
are observable are not investigated here. 


CALIBRATION ACCURACY LIMITS IN ESTIMATION PROBLEMS 


In most estimation problems, calibration measurements are carried out 
initially and later measurements are comparisons between unknowns in the prob- 
lem and calibrated quantities. It is expected that the accuracy to which the 
unknowns can be estimated from such measurements is limited by the accuracy 
of the initial calibrations. The existence of accuracy limits in an estima- 
tion problem implies the existence of corresponding unobservable states and 
parameters and suggests that observability theory can be applied to their 
analysis, as is done below in the case of the generally occurring calibration 
accuracy limits. 

Measurements are comparisons of like physical phenomena, such as two 
lengths or two masses. The basis of comparison is made unique by inter- 
national definition of unit phenomena of time, length, mass, and temperature 
termed the second, meter, kilogram, and degree Celsius. 2 For example, the 
meter is currently defined to be the distance covered by 1,650,763.73 wave- 
lengths of a certain line of a Krypton 86 source under specified conditions 
(ref. 14). Estimation problems can be regarded as exercises in comparing 
some set of physical phenomena to these internationally defined unit phenomena. 
Calibrations are defined here as comparisons of any quantity in the problem 
with the unit phenomena (e.g., measuring a distance by counting wavelengths). 
With the exception of time, which is sometimes measured with a cesium clock, 
the calibrations are usually carried out initially and are excluded from the 
data types. In these cases, the accuracy to which the state variables can be 
estimated in the international units by sampling the data types is necessarily 
limited to the accuracy set by the initial calibrations. 

The nature of the calibration accuracy limit is obvious in simple cases. 
For example, a length estimated by comparison with a meter stick can even- 
tually be estimated, after a series of such independent measurements, with 
nearly the accuracy with which the meter stick was originally calibrated. If 
the length is compared to five independently calibrated meter sticks then the 
unknown length and the lengths of each of the five meter sticks can be esti- 
mated to the accuracy of the best initial estimate of length that here, is the 
length of all five meter sticks laid end to end, or equivalently, the accuracy 
obtained by averaging five independent calibrations of one meter stick. 


2 0ther units are derived from these four (ref. 14). In orbit estimation 
only time and length units appear. 
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In both cases, sampling the data types amounts to a comparison of all 
lengths in the problem with an intermediate length standard, the length of the 
meter stick in the first case and the sum of the lengths of the five meter 
sticks in the second. The estimation accuracy for the intermediate length 
standard cannot be improved by sampling the data type. It is unobservable. 

Other problems are less obvious since calibrations of varying accuracies 
are often buried in the initial estimates of fundamental parameters in the 
problem. However, features similar to those described above can be demon- 
strated; if the data types are not calibrations, then an independent unobserv- 
able state can be shown to exist for each physical unit in the problem. For 
each such state a corresponding accuracy limit is obtained, these limits 
being the best accuracies to which any length, mass, and temperature in the 
problem have been estimated initially. The length, mass, and temperature 
which have been estimated initially to these best accuracies are the inter- 
mediate standards of length, mass, and temperature. They are the "meter 
sticks 1 ' of the problem against which the data types compare other quantities. 


Unobservable Calibration States 

If the data types are not calibrations, they are comparisons or ratios of 
unknown quantities in the problem and, hence, are nonlinear in the unknowns. 
Therefore, for the analysis of calibration accuracy the linear system 
(eqs. (1) to (3)) is assumed to be obtained by linearizing a nonlinear one 
(eq. (6)), using equations (8) and (9) and a reference solution of (6a) given 
from some set of initial measurements which include the calibrations. 

Sufficient conditions for existence of unobservable states - It is shown, 
next, that simple conditions on the nonlinear system suffice for the existence 
of unobservable states of the linearized system. These results are applied in 
the subsequent analysis of calibration accuracy. 

In the following statement, X is the n x 1 state vector, and 
{X^(t), t Q < t < tp} is the reference solution of the nonlinear state equation 
used in linearizing the equations. The data types, (mi(X), i = 1, . . . , k} 
and the components of the state equation 

x(t) = f(x(t)) 


are all assumed defined with continuous partial derivatives on the state space. 


[12] Sufficient conditions. Define the diagonal matrices 





R “I 



A = 

A Ji 


where {Ji, . . ., J n > is a set of integers and X is real and arbitrary. If 
there exists a set of integers, {Jj, . . ., J n }, for which 
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(1) f (AX) = Af(X) 

(2) mp(AX) = m^CX) i = 1, . . . , k 


for all X and A, then the state 


?o = J X A (t 0 ) 

is an unobservable (deviation) state of the linearized system. 

A detailed proof of statement [12] is given in appendix C. The 
conditions of the statement require the data types and the components of 
f(X) to be algebraically homogeneous. As will be shown, these conditions are 
met if the data types exclude calibrations. Then the integer powers to which 
any single physical unit, other than the unit of time, appears in the state 
variables are sets of integers that satisfy the conditions of statement [12] . 
These integers are readily given by inspection of the units of the state 
variables. As shown in appendix C, it follows from Euler f s theorem for 
homogeneous functions that the consequence of the first condition of 
statement [12] is that the vector function 


£(t) = J X A (t) t 0 < t < tp (31) 

is a solution of the linearized state equation and can be given as 

5(t) = <H t, t 0 )JX A (t 0 ) = <j>(t, t 0 )£ 0 
The consequence of the second condition is that the data types satisfy 

5(t) T V mi (t) = t 0 )Vm i (t) = 0 

Noting equation (21a) and equations (3) and (9) , then £ 0 is an unobservable 
state of the linearized system. 

Similar consequences occur for the nonlinear system as well. If the 
first condition is satisfied by some set of integers, then the vector function 

<Kt) = AX A (t) t 0 < t < t F (32) 

is a solution of the nonlinear state equation for all A. If, in addition, 
the second condition is satisfied by the same integers, then these solutions 
are all indistinguishable to the data types; that is, 

m i (’F(t)) = m^X^t)) i = 1, • • • , k 
and the output is identical for all solutions defined by (32) . 
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State equation and eondition one - It is next shown that the state 
equation satisfies the first condition of statement [12] for those sets of 
integers which are the integer powers to which any single physical unit 
(meter, kilogram, degree), other than the unit of time, appears in the state 
variables. 

First, a restriction on the definition of the state is necessary. The 
state, roughly, is a vector list of independent unknowns which suffice to 
define the measurements and the observed system. Every physical quantity in 
the problem is assumed estimated initially, as is implied by the existence of 
a reference solution of the state equation. However, the true value of every 
such quantity is unknown and any difference between the true and estimated 
values can affect the data. Only the international unit phenomena, whose 
values are one by definition, are regarded here as known. In many practical 
problems, some quantities, which are estimated very accurately initially, are 
treated as known, but such an approximation cannot be made here without 
eliminating the calibration accuracy limit. In addition, the state Variables 
are assumed independent (no state variable can be given as a function of the 
others) and suffice to define the measurements and the observed system (all 
unknowns in the problem are state variables or functions of state variables) . 

The components of the state vector are given in units of seconds, meters, 
kilograms, or degrees Celsius, which appear to arbitrary integer powers 
(positive, negative, or zero) 

ou Y * 6- 

[X-jJ = sec m x kgm x deg 1 i = 1, . . ., n (33) 

If one of the units, say length, is scaled by A 


m = Am T 


where A is any real number, 3 then the same state expressed in the two units 
is related by 


X f 



(34) 


If {X^(t) , X^(t), t Q < t < tp} are the same solution of the state equation 
viewed in the two units, both satisfy the state equation 


x A (t) = f(x A (t)) , x A ct) = f (x A (t)) 


3 The new unit, m ! , defines a different number of wavelengths of 
krypton 86 to be the distance unit. This changes the numerical size, or 
scale, of all distances by the factor A. No state variable can become the 
distance, or known unit, by such a change of scale. 
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The vector function, f( ), is unaffected by changes in the scale of any unit 
since all quantities requiring units are necessarily unknowns and are state 
variables or functions of state variables by definition. From this and 
equation (34) it follows that 


f(A L X A (t)) = A L f(X A (t)) (35) 

where 

Thus, the state equation satisfies statement [12] for the integers 
{ , i = 1, . • . , n} which are the powers of the length unit in the state 
variables. The same is true for the sets of integers, (y^, i = 1, . . ., n}, 
etc., corresponding to any other unit, except time, which appear in the 
state variables. 

Data types and condition tuo- In this section it is shown that the data 
types satisfy condition two of statement [12] for those integers given above 
with equation (35), provided only that the data types exclude calibrations. 

Measurements are quantitative comparisons of like physical phenomena. 

For example, radar ranging stations measure the phase shift between emitted 
and received waves, this measurement being a relation between the observed 
system and the station given by 


m(X) 


1 | Rv ~ R S 

2tt A s 


which is a comparison of the station-vehicle distance, [R v - R s | , with the 
radar wavelength, X s . All physical quantities required to define this data 
type Ry, W s , X s appear among or are functions of the state variables by 
definition of the state. 

The basis for comparing like phenomena is made unique by defining a 
single such phenomenon to be the unit. Calibrations are defined here to be 
comparisons of physical quantities with the appropriate unit phenomena. The 
values obtained from calibrations are given in units: seconds, meters, 

kilograms, or degrees Celsius. It is assumed that the state is to be esti- 
mated in these units so that calibrations must occur somewhere in the estima- 
tion procedure. Any other measurement is a ratio of like phenomena, neither 
of which is a unit. Such ratios are nonlinear in the unknowns and their 
values are dimensionless and, therefore, unaffected by any change in the size 
of units. 

Assume that calibrations occur in an initial set of measurements. Then 
the data types, (m^fX), i = 1, . . ., k}, to be sampled are dimensionless 
ratios of functions of the state variables whose values are unaffected by 
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changes in the size of units. If X, X' refer to the same state given in 
each of two sets of units that differ in size, then 

mi(X') = miCX) i = 1, . . k 

where X, X' are related by (34) for changes in the length scale, and by 
analogous relations for scale changes in any other unit. Therefore 

mi (AX) = mi(X) i = 1, . . ., k (36) 


-h Ji J 

where {J^} are the integer powers to which any single unit appears in the 
state variables and X is any real number. Thus the data types satisfy 
condition two of statement [12] for the integers given above, provided the 
data types are not calibrations. In view of equations (35) and (36), it can 
be stated that: 

[13] Statement [12] is satisfied by those sets of integers 
{J-l, i = 1, . . ., n}, which are the integer powers to which any single 
physical unit, other than the unit of time, appears in the state variables 
provided the data types are not calibrations. 


Calibration Accuracy, the Meter Sticks, and Their 
Relation to Initial Measurements 

If the data types are not calibrations then, consequent to 
statements [12] and [13] , unobservable states occur and are defined by 

S 0 = JXo, j = [“Ji J (37) 

where X 0 is the initial state of the reference trajectory and { J i , . . ., 
are the integer powers to which any single unit, other than the unit of time, 
appears in the state variables. For example, the integers {J^, . . ., J n } for 
the length unit are the powers to which the meter is raised in the components 
Xj, . . X n of the state vector. For each unit (length, mass, or tempera- 
ture) which appears in the state variables, one independent unobservable state 
S om , S 0T } is defined by (37). 

The initial state of the reference trajectory is assumed to be an 
estimate based on some arbitrary and sufficient set of initial measurements. 

It is also assumed that the error covariance matrix, P^, corresponding to 
this estimate is given and is a positive definite symmetric n x n matrix. 

A basis of the unobservable calibration states is 
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D’o^om^oxJ 


(38a) 


and a basis of its complement space is defined by 


Xjji = any basis of {X: X^P^ 1 X U = 0} 


(38b) 


Earlier terminology is continued by referring to ^m as the observable 
states, but this is correct only in the absence of independent unobservable 
states other than (38a) . The initial covariance can now be separated into 
parts associated with X m and X u : 


P A = Xm[xJ P A lx "] *£ * *u 

or simply 


It is also convenient to adopt the 

notation 





m 

n m 

Ht 

i— — | — l 





K p a 1x J 

= 

^Z-m 

^mm 

^mr 



H 

1 

^mx 

^TT 


(38c) 


in order to identify individual elements from this positive definite matrix. 

Initial estimation accuracy for any length- A length in the problem is 
any differentiable scalar function of the state l (X) which has dimensions of 
length. If the length unit is scaled by A , then the value of l (X) must also 
be scaled by X to obtain £(X ! ) in the new unit, but if any other unit is 
scaled by A, the value of £(X) is unchanged: 


where 


l(A t X) = X UX) 
l (A m X) = Z( A t X) = Z(X) 


( 39 ) 
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and the subscripts, Z, m, x, indicate that the integers {Ji, . . J n } in A 
correspond to the length, mass, or temperature unit. From Euler’s theorem 
(appendix C) for functions with the above homogeneity properties, it follows 
that 



(40a) 


valid for any length, Z(X). The gradient is understood to be evaluated at 
X 0 . Analogous statements follow for all functions, m(X) , x(X) which are any 
mass, or temperature in the problem 


x> = 




(40b) 


The accuracy to which any length is estimated from the initial measure- 
ments is its MS initial error ratio, that is. 


VZ T P A VZ _ VZ T P m VZ VZ T P U VZ 
na) " Z 2 (X 0 ) " Z 2 (X 0 ) + Z 2 (X 0 ) 

or simply 


(41) 


n(0 = Pm(^) + HuCO 


The part, n u (Z), is due to the unobservable initial errors associated 
with the calibration states. The accuracy to which Z(X) can be estimated 
from any subsequent processing of data from any noncalibration data types 
cannot exceed r\ (t} • Employing equation (40a) in the definition of P u 
(eqs. (38)), obtain 


HuCO = Pzz (42a) 

Thus, the accuracy limit is y^ and is the same for all lengths in the 

problem. In orbit estimation problems in which important lengths, such as 
radar station location, planetary distances or miss distance, are estimated 
from noncalibration data types, y^Z is the limiting accuracy obtainable. 

Analogous statements apply to all masses or temperatures in the problem 

EuC m ) ~ Eu( T ) = P TT (42b) 
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More generally, quantities, g(X), which have composite units (e.g., n^kg^degX) 
have the accuracy limit 


n u (g) = (a,e,y) 



(42c) 


Length with best initial accuracy- A length in the problem is any 
differentiable scalar function of the state Z(X) which has dimensions of 
length. The accuracy of the initial estimate of any such length is given by 


n d) = n m (Z) + u 

Since is fixed and independent of Z(X) then the length which minimizes 

n [I) is any function, Z*(X), such that 


h m (Z*) - 0 


(43) 


This requires 


VZ* T X m = 0 

By definition of X m (eq. (38b)) all vectors which satisfy this equation 
must be linear combinations of the columns of P A X u 

VZ* = P~ A \l 

Since l*(X) is a length it satisfies equation (40a) so that 

“ = [❖aX] 

and then 

vZ* -1/ 

Z*(X 0 ) = P A K^olHl + 

While equation (44a) does not give Z*(X) uniquely, it does give the gradient 
of £*(X) at X Q within the ambiguous factor, Z*(X Q ). Analogous results are 
obtained for the mass or temperature with the best initial accuracies 



^om^Zm + ^ox ^Zt) ( 44a ) 
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(44b) 


vm~ - i ✓ v 

m * (x 0 ) = P A + ^om^mm + ^ox^mx ) 

"Meter sticks" of the problem - In the absence of calibrations among the 
data types, the estimation of the state from the data types is equivalent to 
comparing all lengths in the problem to Z*(X) and all masses to m*(X). The 
quantities, £*(X), m* (X) , x*(X) are the intermediate standards or "meter 
sticks" of the problem since, for their respective physical phenomena, they 
occupy the role of a meter stick used to measure a distance. This is shown 
by the following analysis. 

If the data types exclude calibrations then unobservable calibration 
states are routinely present and can be removed from the data processing as 
described in an earlier section. This is done by transforming the deviation 
state to a new set of variables, w, defined by 


x = 



(25) 


In the present context, *m and are defined in equations (38) . The new 

* bles, w m , w u are the components of the state in ^m » Xu • The components, 
w m , can be estimated from any (sufficient) noncalibration data types 
(eqs. (28)) while the components w u cannot (eqs. (29)). 


The nature of the variables, w u , can be determined by premultiplying 
equation (25) with X^P” 1 , and inverting the result to obtain 


fVl* T x/l*a o) 
w u = I Vm* T X/m* (X Q ) 


(Vt*x/ t* (X 0 ) 


(45) 


Thus, the quantities w u which cannot be estimated are the errors, 1* > m*, 
x* with which the "meter sticks" of the problem have been estimated from the 
initial measurements. 

Rather than analyze the nature of w m directly it is convenient to 
consider that part of the initial estimation error for any length, mass, or 
temperature in the problem which can be estimated from noncalibration data 
types. The estimation error, for any length, 3(X), is given from the state 
estimation error by 


l = V£ T X 
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This can be separated into observable and unobservable parts 


l = VZ T X m w m + VZ T X u w u 


or simply 


The observable part is 


1 - l m + l u 


T'm Z - VZ X u w u 


which, noting equations (40a) and (45) , becomes 


? _ 7 Z(Xp) 

111 Z*(X 0 ) 


T 

VZ* x 


If the ratio of Z (X) to the meter stick is defined 


C46) 

then the estimation error for this ratio is 

fro - Vr T i = 

so that 

Z m = Z*(X 0 )? (47) 

Thus, for any length, Z(X), that part of the initial estimation error which 
can be estimated from samples of noncalibration data types is the error with 
which Z(X) is known relative to the meter stick, Z*(X), of the problem. 
Similar conclusions apply to all masses and temperatures in the problem. The 
estimation procedure, when using noncalibration data types, is at most one of 
comparing all lengths to Z*(X) and all masses to m*(X). 

Calibration accuracy and initial measurements- The accuracy to which any 
quantity requiring units (other than time) can be estimated without performing 
new calibrations is limited. The relation of these limits to the measurements 
performed initially is examined next. 


7 l Ml* 

L ~ l* (x Q ) 


38 



The initial measurements are those used to determine the reference 
trajectory. These have been loosely described as any arbitrary and sufficient 
set. It is further assumed that this set is finite; that a vector, Y, can be 
given which is a (finite) list of all independent physical quantities required 
to define the initial measurements; that the state variables, X, can be 
included in the variables of Y, and that the set of initial measurements is 
sufficient to determine Y. 

In the preceding statement, the independence of the variables, Y, means 
that there is no nontrivial relation 

S 00 = 0 

among these variables. Further, the vector, Y, can always be given as 



Here, X is the state vector which has appeared throughout the earlier text 
and is a list of independent quantities required to define the observed 
dynamic system and the data types. These can always be included in Y, since 
the reference initial state, X Q , could not otherwise be determined from the 
initial measurements. However, the initial measurements are not restricted to 
the data types so that additional unknown physical quantities, Z, are gener- 
ally required for their definition. Finally, the sufficiency of the initial 
measurements to determine Y means that there is at least one subset of mea- 
surements for which an inverse function exists giving all the variables, Y, in 
terms of the data. This, of course, guarantees that X Q can be determined. 

The above description of the initial measurements is sufficient for 
immediate purposes but is incomplete. A statement to insure that all the ini- 
tial measurements are relevant to the estimation of X G is lacking, as well as 
a statement on the measurement noise. 

Next, list all the initial measurements and segregate them into calibra- 
tions of length, mass, and temperature and measurements that are not such 
calibrations; that is. 


ll( Y), l 2 i V, • • • (49a) 

refer to the initial length calibrations, 

mi (Y), m 2 (Y), . . . , tjCY), t 2 (Y), . . . (49b) 

are the mass and temperature calibrations, and finally, 

n x (Y), n 2 (Y), . . . (49c) 
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are the remaining measurements. The measurements of this last category are 
dimensionless in length, mass, and temperature. 


The measurements in equations (49) are required to be sufficient to 
determine Y; that is, some subset of (49) has an inverse function which 
gives Y in terms of the data. Let H^(Y) be the matrix whose columns are 
the gradients of equations (49) evaluated at Y, and let be the noise 

matrix. Take the requirement for the sufficiency of equations (49) to mean 
that Hj\(Y) has maximal rank at every Y, and, for simplicity, assume that all 
initial measurements are made with independent noise so that Qa is diagonal. 
Then the initial measurements can be processed to obtain the minimum variance 
estimate Y 0 and the initial covariance is given by 


E[yy T ] = [haQa 1 ^] 


(50) 


where H A is evaluated at Y 0 and y is the estimation error Y - Y 0 . 

The columns of H a can be partitioned into two parts that contain the 
gradients of equations (49) with respect to X and Z, respectively, and these 
can be further partitioned according to the category of measurements 


II 

% 

_ 

HXZ ! 

T 

H Xm ! 
1 

%T_ 



_ H z_ 


1 tc 

INI 

H Zm | 

H Zt 

DC 

tsi 

2 

1 


The initial covariance of X, which was written as P A and assumed given 
throughout the earlier text, is the appropriate submatrix from equation (50), 
and is readily given as 


= E[XXT] = [hxQa'hT - H^-Q” 1 [h z Qa 1 Hg J ' (52) 


When all the initial measurements are functions of X only, then 

P A * 

T - 1 

In this case, it follows by using equations (40) that the matrix, X^P^ X u , of 
equation (38c) is diagonal and yields the calibration accuracy limits: 







TT 


(£■%) 


( 53 ) 


The summations in equation (53) are taken over the appropriate category of 
initial calibrations and {q^} are the MS measurement noise for these 
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measurements. The meter sticks for this case are given by evaluating 
equation (44). Thus, for length. 



and it can be verified that 


i*m = l 0 




satisfies the gradient equation, with L 0 an arbitrary length factor. Any 
length proportional to the weighted combination, ^ (£i/qi)£i(X) , is estimated 
initially to the accuracy, y 77 . Analogous results are obtained for m*(X) 
and t*(X). 

The quantities, y^, y^, y TT , in equations (53) depend only on the 
accuracies of the initial calibrations. They are smaller in magnitude than 
the MS error ratio of any single calibration if more than one calibration is 
made. If only one quantity is calibrated initially for a particular unit, 
then it is the unobservable meter stick for that unit. If more than one is 
calibrated, then none of them is the meter stick, although the most accurately 
calibrated one may dominate the appropriate sum in equations (53) and be the 
approximate meter stick for its unit. 

In the preceding special case a direct relation between calibration 
accuracy limits and initial calibrations is obtained. However, the condition 
that the initial measurements be functions of X only is not generally sat- 
isfied and the quantities actually calibrated may have little to do with the 
variables of interest, X. For example, in orbit determination problems the 
actual distance calibrations are buried in the multitude of experiments from 
which the value of the speed of light is estimated. This value is, in turn, 
used in the initial estimates of such things as radar station location and 
frequency, gravity field constants. In such cases all the initial measure- 
ments can influence the accuracy limits, y 7y , y^, y . These limits are the 

r t -1 1-1 ^ ^ lL 

diagonal elements of X*P. X u which, for the general case, can be written 
as L u j 

[xjp-'xj" 1 = [d - RVR] 1 (54) 

where X u are the calibration states for X (eq. (38a)), and D is the 
diagonal matrix 

X) ^ (Y 0 )/ 

D = 


E^Ofol/qi 
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Also, I z and R are 


i z - «zq>I 


R - [ H z l^l 1 \ HzmQi'i | H zt Q;' ? ] 

The matrix R has one column for each unit other than time; H z ^, H zt 

are the partitions of H z in equation (51); Q^, Qm, Q x are the corresponding 
partitions of the noise matrix, and l , m, t are the vectors 



The elements of the matrix on the left in equation (54) appear in the earlier 
results (eqs . (42), (44)) as ]i u> y^, y^ m , .... All the initial 
measurements can appear in these elements because of the term, R T I Z 1 R. 


Finally, it can be shown from equation (54) that the calibration accuracy 
limits satisfy 


Hi 



(55) 


and similarly for y mm , y TT - The equal sign is removed whenever R has no 
trivial column; that is, whenever any initial calibration has nontrivial 
dependence on the variables of Z. 


Appendix D contains a simple example to illustrate the use of the above 
results . 


Clock Calibration, the Velocity Standard, and 
Estimation of the Speed of Light 

The time calibration state was ignored in the earlier discussion. The 
internationally defined second is based on transitions between energy levels 
in Cesium-133 (ref. 14). If a cesium clock is used, all time measurements are 
calibrations and an unobservable time does not occur in the problem. 

Assume, then, that a clock different from a cesium clock is in use and 
that when the clock indicates the passage of one clock second, r seconds on a 
cesium clock (standard time) have actually passed. Also assume that 
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r = 1 second and r = 0 


(56) 


In general the clock rate will drift (r ^ 0) but it is assumed here that the 
drift is negligible over the time interval of the observations. 

The clock rate, r, is analogous to the length of a meter stick in that a 
reading of the clock time, x, is a dimensionless comparison 


(57) 


of the standard time, t, with the clock rate, r, in the same way that length 
measurements using a meter stick of length, l , is a dimensionless comparison 
of two lengths 


m 


l 


~ l* 


The state equation 


at = ™ c 6a ) 

governs the evolution of the state as seen in standard time. As seen in 
clock time, the state satisfies 


dX 

dx 


rf (X) 


the state can now be augmented with the unknown clock rate 


dy _ _d_ 
dx dx 



Arf (X) N 

V o J 


= g(y) 


The time calibration state- As before, assume that the time unit is 
scaled arbitrarily 


sec = X sec ! 


Then all unknowns, as seen in the new unit, are scaled accordingly 


v' 

= X -1 v 

frequencies and velocities 

r' 

= Xr 

clock rate 

T ' 

= X 

time-dimensionless quantities 
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whence 


Y' = AtY 


A t = [Vi J 

{Ji> = powers of the time unit in variables of Y 


As before, it is readily shown (noting that the independent variable, t, is 
unaffected by the scale change) that 

A T £(Y) = g(A T Y) 

Assume, also, that all measurements to be processed are time dimensionless, so 
that 


mi(A T Y) = mi(Y) i =• 1, 2, . . . 
Then, using statement [12], page 29, the following state 

Z 0 = j t y 0 


(58) 


(59) 


where 


Y„ = 


x o(t 0 ) 


M 

^Xo(t 0 ) ^ 

reference initial state 


is an unobservable state (the time calibration state) of the linearized 
augmented system. 

The linearized augmented state equation- This equation can be routinely 
derived as 


where 


% - coyy 




Here* F(X 0 ) is the coefficient matrix of the linearized equation in standard 
time. 

An alternate derivation can be made by defining the state deviation 
5(t) = X(t(x)) - Xq(t) = X(t) - X D (x) + X 0 (x)(t(x) - t) + . . . 
or 

5 00 = x(x) + X D (x) x6r + . . . (61) 

where x(x) is the usual definition of the state deviation (X(x) - X 0 (x)), 
which, to first order, is a solution of 

§7 = F(x 0 (x))x 

Equation (60) is now readily reproduced by taking the T-derivative of 
equation (61). Thus, the augmented linearized equation describes the time 
history of £(x) (eq. (61)) and includes the effect of the secularly 
increasing discrepancy between standard time and clock time. 

Linearized measurements - If a data type, M(Y) , is sampled the linearized 
data used in the linearized estimation procedure can be routinely expressed as 

6m = ( vm )y o ( t )XCt) + e 
or 

6m = h(x)^y(t 0 ) + e 

where 

h(x) = 4 > T (T,t 0 ) ( VM )y o (t) 

An alternate derivation can be given as follows. Suppose a data type, M(X), 
is sampled at the clock time, x. The sampling obtained is then 
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m = M(x(t(x))) + e 

That is, it is a (noisy) sampling of M(X) on the actual state at the standard 
time corresponding to the clock time, t. The residual or linearized sampling 
is then computed by subtracting the value of M(X) evaluated on the reference 
state at the time t = x. 


6m = M(x(t(x))) - M(X 0 (t)) + e 

which, after expanding X(t(x)) and retaining first-order terms, is 


6m = VM^Ct) + e 


where £(t) is the state deviation defined in (61). It is not necessary to 
include the clock reading as an additional measurement in the problem, since 
any random reading error, e r , can be included with the sampling error, e, in 
the amount, VMTx o (x)e r . 

Velocity calibration accuracy- If all data types are dimensionless in 
distance and time, an unobservable velocity exists which serves as the stan- 
dard against which all velocities in the problem are compared. Its initial 
estimation accuracy is the lower bound on the accuracy to which any velocity 
can be estimated from any sampling of the data types. 

This estimation accuracy limit is given as 

'V = Hi ~ 2V U + P TX C 62 ) 

where y 77 , y 7 , y are the elements of the matrix 

u U L- T T T 


ll 

) 

■ DfoV 

r 

It 

W 




Y u 

- [SjEJ 




- -Vo 

distance calibration state 


2, 

- h y o 

time calibration state 


P A 


initial covariance matrix 
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Equation (62) for the velocity accuracy limit is obtained by applying 
equation (42c) . It can also be shown that the velocity standard in the 
problem, v*, is proportional to l*/t* where l *, t* are the distance and 
time meter sticks of the problem. 

Speed of light estimation in space mission problems- Space probe 
trajectory estimation problems have been treated as experiments to estimate a 
number of fundamental parameters of the solar system. One expects that a 
necessary condition for significant improvements in the estimates of such 
parameters is that their initial estimation accuracies tie significantly poorer 
than the appropriate calibration accuracies in the experiment. 


For example, consider the possibility of improving the estimate of the 
speed of light in a space probe trajectory estimation problem using radar 
measurements. The independent unknown quantities required to define the data 
types can be listed, in general, as 


Y = 



Speed of light 
Clock rate 
Vehicle state 

Radar station locations and frequencies 


Earth rotation rate and gravity field parameters 



Planet states and gravity field 
parameters 
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The initial measurements include an experiment to estimate the speed of light 


mj = d = cAt 
m 2 = At 


This experiment consists of measuring a distance, d, and the time. At, 
required for light to travel this distance. Both measurements are calibra- 
tions and the experiment {mi, m 2 } is equivalent to a large number of similar 
experiments from which the current estimate of the speed of light is obtained. 
The initial estimation accuracy for the speed of light is then 


^c 



°L2 


At' 


where q , q , are noise variances of the two calibrations. Currently, the 

1 2 . _ g 

speed of light is estimated to one part in 300,000 or /(q c ) = 0.3x10 

All remaining initial measurements are assumed to be functions of Y 
and these may include additional distance and time calibrations 


l H™ 

± 1 = 1 ,. 


ti T 00 

i T = • 

. ., n t 


The complete set of initial measurements depends on At as well as the 
variables of Y. In that case, temporarily augment the state to 



and then the initial covariance of Y' is given as 

p A U') = [h a Qa 1 h a] 



c 

1 | 0 ... 0 
0 | 






Ate 




-1 

1! 

£ 

0 

1 

0 1 
1 

and Q a = 


«2 



_ 0 

o ! 




• — 
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It is assumed that the rank of H ^ is equal to the number of variables in Y 1 . 


The variable. At, is irrelevant to the estimation problem since the data 
types are independent of it (they are functions of Y) . Neither is it of 
interest to estimate At via the correlations of its initial estimation error 
with those of the remaining unknowns. In that case. At, can be removed from 
the estimation equations as described earlier and, in particular, equa- 
tion (54) is applied to obtain the calibration accuracies of the reduced 
problem. The result for the velocity accuracy limit is (lengthy algebra is 
omitted here) 


^v = 





(63) 


where 



The summations in a y are taken, respectively, over all initial length and 
time calibrations except those in the speed of light experiment {mi, m 2 }. It 
is apparent from the result that 

1. If the speed of light experiment had not occurred then the velocity 
accuracy limit would be a v . Conversely, if no calibrations other than 

{mi, m 2 } had occurred, then the velocity accuracy limit would be q c and the 
speed of light would be the unobservable velocity standard. 

2. In general, n v < q c and it is theoretically possible to improve the 
accuracy of the speed of light estimate to ri v by sampling noncalibration 
data types of the form m(Y). 

3. A necessary condition for significant improvement in the speed of 
light estimate by sampling noncalibration data types, m(Y), is, roughly, that 


% 


o 


v 


> 


1 


that is, that the velocity calibration accuracy due to the remaining 
calibrations, {l±}, {t^} be at least as good as the accuracy of the initial 
speed of light estimate, q c - 

4. In practice, it is usually true that 
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so that negligible improvement can be expected and the speed of light is very 
nearly unobservable in space vehicle trajectory estimation problems. 


Discussion 

An elementary view of estimation is taken here to analyze the naturally 
occurring calibration limit on the obtainable estimation accuracy. Estima- 
tion, in general, is a comparison of some set of unknown quantities with the 
internationally defined unit quantities: the second, meter, kilogram, and 

degree Celsius. A direct comparison of some quantities in the problem with 
the unit quantities (calibrations) must occur at some point in the estimation 
procedure. In most cases this is done initially and subsequent measurements 
are comparisons among unknown quantities. In that case, the accuracy to 
which the unknowns can be estimated in terms of the international units is 
limited by the accuracy of the initial calibrations. For example, the accu- 
racy with which a length can be estimated by comparing it to a meter stick is 
limited by the accuracy with which the meter stick was calibrated initially. 

Estimation problems are generally analogous to the above simple example 
even though several data types or measuring devices, many initial calibra- 
tions, and other physical units besides the unit of length may appear in the 
problem. If the data types are not calibrations, then an independent 
unobservable state is readily given, one for each physical unit in the problem 
(statements [12] and [13] and eq. (59)). These states define an intermediate 
standard of comparison, or meter stick (eq. (44)), and a calibration accuracy 
(eq. (42)) for each physical unit (time, distance, mass, and temperature) 
having the best initial estimation accuracy in the problem. The procedure of 
estimating the unknowns by sampling noncalibration data types is, at most, one 
of comparing all times, lengths, masses, and temperatures in the problem to 
these meter sticks. 

The initial covariance and the calibration accuracies can be given in 
terms of the initial measurements (eq s. (49) to (52)). In the simplest case, 
the accuracy limits depend only on the initial calibration measurements 
(eq. (53)) but, in general, these limits can depend on all initial measure- 
ments (eq. (54)). An example is given in the discussion of the velocity 
accuracy limit in space probe trajectory estimation (eq. (63)). 

Unobservable calibration states are theoretically present whenever the 
measurements are not calibrations, but these states are often eliminated in 
practice by treating some quantities, which have been estimated very accu- 
rately initially as exactly known quantities. For the theory of this report, 
the values of all physical quantities in a problem are considered unknowns; 
that is, all are estimated initially but their true values are unknown and 
their estimation errors can affect the data. The state variables to be esti- 
mated are then any set of independent unknowns which suffice to define all 
unknown quantities in the problem. In practice, variables which are very 
accurately estimated initially are often deleted from the state variables; 
that is, they are treated as having no estimation error. It is assumed that 
the inclusion of such variables in the estimation would have negligible effect 
on the estimates of the remaining state variables. As a result of this 
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procedure, the sufficient conditions for the existence of unobservable 
calibration states (statement [12]) are no longer satisfied by the problem; 
the required homogeneity properties of the state equation are eliminated and 
the data types can become calibrations or comparisons with exactly known 
quantities. The resulting removal of the lower bounds on estimation accuracy 
causes no difficulty in practice provided reported accuracies do not approach 
the actual calibration accuracy limit. It is, of course, not possible to 
exceed this limit without performing new calibrations. The size of the cali- 
bration accuracy limit is given from the initial covariance (eq. (42)) and can 
be estimated if some reasonably correct idea of the initial measurements is 
available. 

Generally, a necessary condition for significant improvement in the 
estimate of any parameter is that its initial estimation accuracy be signifi- 
cantly poorer than the appropriate calibration accuracy limit. At best, the 
parameter can be estimated to nearly the calibration accuracy limit by indef- 
inite sampling of the data types. However, the existence of unobservable 
states in addition to the calibration states can impose further limits on the 
accuracy to which a parameter can be estimated. 


RESUME 


Mathematically, estimation is a matter of processing noisy measurements 
to obtain the best estimate of the unknown quantities on which the measure- 
ments depend. The elements which appear in the processing equation for linear 
estimation (eqs. (5)) are the prior estimate and covariance and the measure- 
ments (costates, MS noise, and data). The processing results in a new esti- 
mate of the state and its covariance of estimation errors. This report 
describes some results from a study of the relations between the elements of 
the processing equation and the resulting estimate and performance. 

In the first section, the processing of arbitrary sequences of measure- 
ments is examined using the notion of equivalent sequences and data. This 
notion provides a basis for carrying out a number of operations in the pro- 
cessing of measurements. A sequence can be compressed to an equivalent 
sequence containing a minimum number of measurements. The uncorrelated equiv- 
alent basis sequence and its associated eigenvalues give the performance of 
the sequence in reducing the estimation errors from their values prior to 
processing. Further, equivalent sequences can be used to selectively reduce 
the information to be processed so as to estimate only some of the state vari- 
ables, or to separate the processing equations into independent lower order 
parts. 

In many estimation problems, the measurements are obtained by sampling 
one or more data types at an arbitrary sequence of times. The characteristics 
of the data types are then constraints on any such sampling. One character- 
istic is observability, the sufficiency of the data types for the determina- 
tion of the state variables. If the data types are insufficient, then a 
suitable transformation of the state gives the data types in terms of a 
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reduced set of new variables which are linear combinations of the original 
variables and which can be determined from some sampling of the data type. 

This transformation also separates the estimation calculations into trivial 
and nontrivial parts, the latter being a lower order set of equations for 
estimating the reduced set of observable unknowns. 

A theory of calibration accuracy limits in estimation problems is 
obtained by applying observability analysis and recognizing the physical 
nature of estimation. Estimation is intended to compare a physical system 
with the internationally defined physical units and this is done by taking 
measurements which, in general, are comparisons of like physical phenomena. 

If the measurements are not calibrations (not direct comparisons with the unit 
physical phenomena) , then the estimation procedure is shown to be one of com- 
paring the system to a set of intermediate standards of time, length, mass, or 
temperature. These intermediate standards are unobservable and their initial 
estimation accuracies (calibration accuracies) limit the accuracy to which any 
time, length, mass, or temperature in the problem can be estimated without 
performing new calibrations. Further, to obtain significant improvement in 
the estimate of any unknown, it is necessary that its initial estimation accu- 
racy be significantly poorer than the appropriate calibration accuracy. The 
existence of independent unobservable parameters in addition to the 
intermediate standards will further limit the obtainable estimation accuracy. 

In principle the state is either observable to the data types or the 
estimation problem can be reduced to a lower order problem in which a reduced 
number of variables are observable. Computations to determine the observable 
parameter space and carry out the reduction are based on the information 
integral (eq. (19)). An application of this method is found in reference 12. 
The necessary computations are difficult, especially in large-order problems. 
Alternative methods based on derivatives of the data type (ref. 13) are avail- 
able but not yet explored. The structure of relations between data and per- 
formance in the case that all states are observable remains unclarified. In 
many orbit determination problems using radar data it is recognized that one 
or more parameters may be observable but computationally difficult to estimate 
(e.g., refs. 4, 5, 12) because the data type provides little information on 
these parameters compared to others. Computationally, such cases can be 
treated with special procedures (e.g., refs. 5, 12). 


Ames Research Center 

National Aeronautics and Space Administration 

Moffett Field, Calif., 94035, January 29, 1971 
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APPENDIX A 


COMPUTATION OF AN EQUIVALENT BASIS SEQUENCE 
Given an information matrix 

If = hq _ 1 h t 

obtain an equivalent basis; that is, a matrix, of maximal rank such that 

H E He = If 

This problem is identical to the computation of a square root matrix of 
maximal rank for any symmetric positive semidefinite matrix. The method 
reviewed here is also described in reference 10. 


COMPUTATIONAL METHOD 


Let B be any n x n symmetric positive semidefinite matrix and let 
{v. } be the columns of any nonsingular n x n matrix. Define the matrices 
{Bj} as 


Bi = B, 

B, 

k+i 

Wk B k 

B k 

v k B k v k 

if 

; I B k"k * 

0 

(Al) 



B k 

if 

\Vk = 

0 



and define the vectors {c^} as 



if v^B k v k * 0 
if v^B k v k = 0 


(A2) 


The matrix whose columns are the nontrivial vectors among {c^, k = 1, . . . , n} 
is a square root of B having maximal rank. 
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This result is proved in the following series of statements . 

[1] The matrices Bj, B 2 , . • . are all symmetric positive semidefinite. 
This follows from induction since the statement is true for B 2 and if it is 
assumed true for B k it follows that it is true for B^ +1 . 

[2] The matrix B n+1 is zero. 


Bn+ l-O 


The null space of B k+ i includes (a) the vector, v k and (b) the null space 
of B k . This can be verified from the definition (Al) above; it follows 
immediately that B k+1 v k = 0 and that if v is any vector for which B k v = 0 
then also B k+1 v = 0. Therefore, the null space of B k+1 includes the vec- 
tors v l5 v 2 > . . . , v k and, in particular, the null space of B n+ i includes 
all n independent vectors, (v k ), and is zero. 

[3] Let C be the matrix whose columns are the nontrivial vectors among 
{c k , k = 1, . . ., n}. C is a square root matrix of B 


B = CC T 


(A3) 


Employing statement [2] and the definition of and C obtain 

nr rp 

0 = B n+ 2 = B n - C n C n = . . . = B ^ ~ c k c k = B - CC 

k=l 

[4] The nontrivial vectors among (c k , k = 1, . . . , n} are independent 
and C has maximal rank. 

To prove this, show that there is no nontrivial linear combination of the 
nontrivial (c k ) which is zero. Let 

E s k^k = 0 ( A4 ) 

k=l 


be any linear combination which is zero and consider the dot product of vj, 
with this sum of vectors 


n 


E 

k=l 



for nontrivial 


c k we have from (A2) 


(A5) 
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I 


-T- 


V l C k 


^T B k^k 

,-T - vl/2 

( V k B k v k) 


But, as pointed out in statement [2], the vector vj is in the null space of 
all {B k , k > 2} and all B k are symmetric positive semidefinite so that 



k > 2 


(A6) 


and equation (A5) becomes 


s l v l c l = 0 


Thus, if c x is nontrivial, then si is zero and equation (A4) becomes 


X 


k=2 


s k c k = 


0 


(A7) 


These steps are repeated for v 2 , forming the dot product of v 2 with 
equation (A7) and obtaining s 2 = 0 if c 2 is nontrivial. When repeated in 
succession for v 3 , v 4 , . . . , v n , the {sj} are zero for all nontrivial cj . 
Consequently, there is no nontrivial linear combination of the nontrivial 
{c R } which is zero. These vectors are, therefore, independent and the matrix 
C has maximal rank. 

In summary, the matrix C whose columns are the nontrivial {c^} is an 
equivalent basis of any sequence whose information matrix is B. 


CHOICE OF THE VECTORS {v } 

K 

The vectors {v k } can be any n independent vectors. The columns of the 
unit matrix are utilized in reference 10 and this choice places the resulting 
square root in lower triangular form. 

Alternatively, the {v k } can be selected to remove beforehand those which 
yield trivial vectors in equation (A2) . This can be done by obtaining a 
partitioned matrix 

V = [V u j V m ] 

in which the columns of V u are any basis of the solution space of 

Bv = 0 (A8) 
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and the columns of V m are any completion to a set of n independent vectors * 
for example, a basis of the solution space of 

T- 

V = 0 

It is readily seen that the number of columns of V m is the rank of B, and 
that all columns of V u give trivial vectors, (J^, when processed in equa- 
tions (Al) and (A2) . In that case, only the columns of V m need be processed 
to obtain the square root of B and these all yield nontrivial vectors in 
equation (A2) . 

Insensitivity to computational error is an important factor in the choice 
of computational procedure for this problem, but this question is not 
addressed here. 
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APPENDIX B 


INFORMATION SEPARATION 
SEPARATION LEMMA 


Let (H, Q) be a measurement sequence and the column space of H. 

Let W A be any arbitrary subspace of "K and 

dim (pC A n^) = k 


Then an equivalent basis sequence of (H, Q) exists which has k elements that 
are a basis of 3C A n 

The equivalent bases of (H, Q) are all the square roots of maximal rank 
of its information matrix. All such square roots are bases of the same space, 
^ . The statement to be proved asserts that for any specified subspace of 

a square root can be found, part of which is a basis of the specified sub- 
space. Such a separated equivalent basis is not unique for the specified 
subspace, but it separates the information matrix uniquely. 



The sketch visualizes the lemma. 
Here, K is the whole costate space and 
includes both 3C A and 2^. A parti- 
tioned equivalent primary basis 

(H, Q) = [Ham | H R ] 

is to be found for which H^f is a 
basis of 3C a n and Hr refers to the 
remaining columns which are a basis of 
some subspace of outside 


PROOF 


The following proof outlines the required computational procedure. 

1. Let H a , H m be, respectively, any basis of K A and an equivalent 
primary basis of (H, Q) . The first is assumed given as the definition of JC A 
and the second is computed by procedures already given and always exists. 
Assume, also, that H m has r columns. 

2. Compute any basis, H^y^, of the subspace ^AM* The intersection, 
^AM> is always a linear vector space so that a basis exists and can be 
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computed as follows. Let \ HgJ be any completed basis of the whole space, 

5C, which contains and express H m in terms of this basis 


H ra = [h A 

where the coefficient matrix, 
define 




computed by inverting 


(Bl). 


Then 


(Bl) 


Di e any basis of (d: Bd = 0} 


(B2) 


and then 


HaM 2 = %Di = H a AD! (B3) 

Every column of is simultaneously in M A and every column is 

independent, and every vector which is both in 5C A and can be given in 
terms of Consequently, Hy^ is a basis of My^. 

3. Next, define the partitioned orthogonal r x r matrix 

[D i S] 

where D is obtained as the Gram-Schmidt orthogonalization of the columns of 
Di, and S is any orthonormal basis of { s : D^s = 0}. Then define 

H* = H m [D | S] (B4) 

Since [D j S] is orthogonal then 


K* H 


m 


(B5) 


and the submatrix 


^AM - ^m^ 


is a basis of My^, while 


Hr E H m S 

is such that 3Cr n = o. Consequently, 


(B6) 
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HrS = [ h AM i Hr] 


(B7) 


is the required partitioned equivalent basis of (H, Q) . The corresponding 
equivalent data for the fictitious measurements defined by the columns of (B7) 
is given from equation (12) of the text. 

DISCUSSION 


1. The separation procedure is trivial in the case of sequential 

processing of single measurements. A single measurement is either in or 
outside of so that one part of the separated basis is the measurement 

itself and the other part is null. 

2. Given the sequence, (H, Q) , and the subspace, the information 

matrix is uniquely separated by the above procedure 

HQ _1 H T = + HrHr (B8) 

The corresponding parts of any two partitioned equivalent bases are also 
equivalent since they are constructed to span the same subspaces (statement [6] 
of the text) . 

3. The separation states that the equivalent basis can be selected so 
that some of its measurements are in any specified subspace and the remainder 
are outside that subspace. This permits selective processing of information. 
The available information is fixed by the actual sequence of measurements but 
this information can be reduced by deleting or delaying the processing of any 
measurement (s) from the actual sequence or from any equivalent sequence. 
Finally, the separation permits removal of measurements within or outside any 
selected subspace by deleting from processing either of the terms in (B8) . 

4. The influence of each of the separated terms in (B8) on performance 
(reduction of MS estimation errors from their prior values) is considered 
next. Suppose Ka is some given subspace of K and let 5% be its 
uncorrelated complement space 

5% = {h: h T P 0 H A = 0} (B9) 

P Q being the prior covariance. If only the term. Ham* is processed then 
there will be nontrivial reduction of MS errors for every parameter whose 
costate is not in the uncorrelated complement space of ^aM* This includes 
l^AM and ma y or may not include all of ^a* There is no error reduction for 
any parameter whose costate is in the complement space of &AM anc * this 
includes ICg. 
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If only the measurements Hr are processed, then there is nontrivial 
error reduction for some parameters with costates in Kg since every column 
of Hr is outside K A . In addition, there is nontrivial error reduction for 
some parameters with costates in K A except in the special case that every 
column of Hr is in Kg. Thus, the capacity of the original sequence to 
reduce errors in K A is not generally isolated in the extracted measurements, 

h AM- 


As noted above , if only the extracted measurements, H^j, are processed 
then there is no error reduction for parameters with costates in ?Cg. Further, 
the posterior estimation errors for parameters with costates in are 

mutually uncorrelated with those in 3fg so that the complement space of 
is the same set of parameter costates for both the prior and posterior 
covariances. Consequently, the complement space of ^ remains fixed if a 
succession of measurement sequences are treated in the same way. It is 
possible, therefore, to prevent error reduction on any given subspace, 2%, by 
selectively processing only information in its complement space, and the 

complement space need be determined only once. 


SEPARATION OF ESTIMATION PROBLEMS INTO INDEPENDENT PARTS 


The estimation calculations can be forced to separate into independent 
parts by selective removal of some information from each sequence that is 
processed. The data processing then separates into two lower order problems. 

Let P^ be the initial covariance (MS estimation errors of the 
reference trajectory), let be any given set of parameters with basis, H^, 

whence its initial uncorrelated complement space is 


Kg = {h: h T P A H A = 0} (BIO) 

The two subspaces, 5^, Kg form a complete decomposition of the whole costate 
space and will remain fixed throughout the data processing. 

Suppose that the following two-part separation is made on every 
measurement sequence (H, Q) to be processed 

W, Q) * K | H rb j H r ] (BID 

where j is the first separation of (H, Q) into a basis of and 

a remainder H R ; and jjt^g j is a separation of H R into a basis of 

f ! 

l JCg with H r as the remainder from this second separation. If H R is 

removed from every sequence to be processed then is unobservable to the 

measurements H R g, and 5Cg is unobservable to H^, so that the estimation 
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calculations can be decomposed into two lower order problems associated with 
each of the two subspaces. Some formulas for the separated problem are given 
next. Algebra is largely straightforward and omitted. 

Let Hg be any bases of ICg, respectively, and define the 

corresponding subspaces of the state space with bases 


X A - P A H A 

► 

X B = P A H B , 

Together these form a transformation of the state space 


CB12) 


- [*A 


X B ]w - x A w A + X B W B 


(B13) 


The new variables, w, are the components of the state, x, in the basis 

[X A j XgJ , and these are linear combinations of the parameters of and Hg, 


since 


W A 




h I p a h a 


T 

H h 


h b p a h b 


k -l 


h b x 


Transforming the state estimation error, x, into the new variables, w, obtain 
the initial covariance of w as 


W 0 



( h a p a h a) _1 

1 

O 

1 i 

O | 

1 

1 -1 
] ( h J p a h b) _ 


(B14) 


The nontrivial parts of W Q are the initial covariances of w^, Wg , and are 
written Wq W gg hereafter. 

If one measurement sequence is separated as in (Bll) and only Hgg 

are processed then the posterior covariance of w is 





« * m a m I ) 1 

1 — 


T 1 

O 

1 


1 0 

1 

w = 

1 

1 

= 

— — — — . 

1 

1 

! / -i T \-i 


_o ! W B _ 


0 

! ( W 0B * H B H b) J 


(BIS) 
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where, for convenience, the notation 


M A = - H T A PfH m ' 

% * X B H RB " H b P A H RB , 


has been used. 


The estimate, w, is given by 

"A “ “oa“a[i * M I W OA M a] ly A 
*B ' "oB^t 1 ♦ M B K OB M b]‘V b 




> 


/ 


(B16) 


(B17) 


Here, y^, y g are the equivalent data vectors corresponding to the measure- 
ments H^j, H R g, respectively, and which are assumed computed with the separa- 
tion of (H, Q) in accordance with equation (12) of the text. If additional 
measurement sequences are separated as in (Bll) and only H^j, H^g are 

processed from each sequence, then the processing equations remain separated, 
being the same as (B15) and (B17) except to replace, Wq^, W Q g with the prior 
covariances and to replace w^, Wg with the changes in estimates 


Q A~ "~A > *B " * B 

Thus, by removing from each sequence, the calculations for 

estimating x can be separated into two independent lower order problems for 
estimating w wg. The processing equations (B15) and (B17) are entirely 
analogous to the original equations (5) in the text. 

Finally, the estimate, x, and covariance, E [xx^] , for the original state 
variables are recovered at any step from 


x = 
P = 


X A W A + 


B B 


Wa * x b"b x b 


(B18) 


The separation above is trivial for sequential processing of single 
measurements and is intended only for treatment or analysis of one or more 
batches of measurements. The neglected measurements, Hr, are generally non- 
trivial and provide nontrivial error reduction for some parameters in both 
and Jfg, since all the columns of are neither in ^ nor &g by con- 

struction. Further, the information content of all the neglected measurements 
will differ depending on the subdivision of the total set of measurements into 
batches for separation. Although the choice of subspace, can be made 


62 



arbitrarily, the importance of the neglected measurements, H^, to error 
reduction in is expected to depend on how well the choice of 

corresponds to some naturally occurring subdivision of the problem into nearly 
independent parts. Such a natural subdivision, when it exists, would be 
inherent in the problem; that is, in the observed dynamic system and in the 
data types employed for measurements, but this matter is beyond the present 
scope which deals with processing arbitrary measurement sequences. 
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APPENDIX C 


SUFFICIENT CONDITIONS FOR UNOBSERVABLE STATES 


The sufficient conditions and corresponding unobservable states derived 
here are the basis of discussion in the text of the proposition that if data 
types exclude calibrations, then the physical scale of the problem cannot be 
determined to better accuracy than it was known a priori , and this limits the 
accuracy to which the state variables can be determined from the data types. 

The statement to be proved here is that unobservable states exist if the 
state equation and data types possess certain homogeneity properties. These 
properties occur naturally in many problems as follows: the state variables 

have arbitrary units; compatibility requires that the components of the state 
equation be homogeneous functions, the consequence of which is that a class 
of solutions related to the reference trajectory can be given. These solu- 
tions can be regarded either as distinct or as the same solution viewed with 
different scales for the physical units of length, mass, and temperature. If 
calibrations are excluded the data types are also homogeneous functions which 
are unchanged by any change in the scale of physical units. Hence identical 
data are generated by any of the trajectories mentioned above. 

The homogeneity properties given below suffice for the existence of 
trajectories which are indistinguishable to the data types. The corresponding 
result for the linearized problem is obtained by applying Euler* s theorem for 
homogeneous functions to determine appropriate unobservable states. 


HOMOGENEOUS FUNCTIONS 


In the following, f(X) is any scalar function of the n x 1 state vector 
defined on an open set, S, in X. Homogeneous functions are usually defined 
as follows (ref. 15, p. 134). 

Definition . f(X) is homogeneous of degree p on S provided 

f (AX) = A P f(X) (Cl) 


for all x in S and all real A such that XX is in S. 

It is necessary to generalize this definition in order to study homogeneity 
properties related to scale changes in physical units. 

Definition. f(X) is homogeneous of degree p for the integers 
(J i , . . J n ) provided 
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where 


£ (AX) = A P f(X) 


A = f' X Ji J (C2) 

for all X in S and all real A such that AX is in S. 

The integers in this definition can be positive, negative, or zero. The 
definition (C2) includes (Cl) as a special case. 


INDISTINGUISHABLE TRAJECTORIES 

In the following, {m^(X), i = 1, . . . , k} are the data types, and the 
state equation is 


X(t) = f(X(t)) (6) 

The data types and the components of f are all assumed defined in an open 
set, S, in X and to have continuous partial derivatives. 

Two solutions of the state equation {Xj (t) , X^Ct), t Q < t < tp} are 
indistinguishable provided they generate identical data; that is, provided 

m i (x 1 (t)) = m i (X 2 (t)) i = 1, • • k; t Q < t < t F (C3) 

Indistinguishable trajectories can arise as follows. Let 
{X Q (t) , t 0 < t < tp} be any solution of (6). 

If there exists a set of integers { J j , . . ., J n } such that 



(1) 

f(AX) 

= Af (X) | 

1 


(C4) 


(2) 

mi (AX) 

= mi (X) i i - 1> • • • > k J 



(where A 

= l> Ji J 

) , for 

all X and S and all real X 

such that 

AX 

is in S 

then 

<Kt) = 

AX 0 (t) t Q < t < tp 


(C5) 


is a solution of (6) for any allowed A and all such solutions are 
indistinguishable . 
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The conditions of (C4) require the components of f to be homogeneous, 
respectively, of degree J , . . . , J n and the data types to be homogeneous 
of degree zero, all for the same set of integers. As discussed in the text, 
such integers occur naturally if we let {J^} be the integer powers to which 
any single physical unit (e.g., meter) other than the unit of time appears in 
the state variables, and if the data types exclude calibrations. 

That (C5) defines solutions of the state equation follows from the first 
condition of (C4) 

<Kt) = AX 0 (t) = Af(X 0 (t)) = f(AX 0 (t)) = f(«Kt)> 

That these solutions are indistinguishable follows from the second condition 
in (C4) . 

The corresponding result for the linearized problem is obtained by 
applying Euler's theorem for homogeneous functions to show that the conditions 
in (C4) are sufficient to define unobservable states; that is* initial states* 
£ 0 , which define linearized trajectories {<£>(t, t 0 )£ 0 ; t Q < t < tp} on which 
the linearized data types, VmT$(t* t 0 )£ 0 are zero at all times. 


EULER'S THEOREM 


For the usual definition of homogeneous functions* (Cl)* Euler's theorem 
states (ref. 15* p. 134): 

If f (x) is homogeneous of degree p and differentiable at X* then 

Vf(x) T X = pf (X) (C6) 


The proof is accomplished by taking the derivative of both sides of (Cl) with 
respect to A and evaluating at A = 1. The validity of these steps is 
assured by the differentiability of f(x). 

Some difficulties of basic definition must be overcome before an 
appropriately modified Euler theorem for generalized homogeneity can be given. 
The fundamental notions of analysis involved in (C6) - neighborhoods* open 
sets, continuity, and differentiability (ref. 15) - are ordinarily defined on 
the assumption that an absolute value, |x| is defined on X. However* | X j is 
undefined here owing to the arbitrary units permitted the state variables in 
this paper. 

To bypass these difficulties a norm il Xll is now assumed defined on X and 
then the usual definitions of analysis are generalized by replacing jxj with 
II X il wherever necessary. In particular, the appropriate generalization of 
differentiability is (e.g., ref. 16, p. 172): 
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A scalar function, f (X) , is Frechet-differentiable at X provided that 
for every h in x there exists a function, df(X; h) , such that 


lim ( f(X + h) ~ fffl " dfCx;h) l = 0 (C7) 

|| h II ^ 0 1 11 h 11 1 

If f (X) is Frechet-differentiable then (ref. 16) 

df(x;h) = f(x) T h (C8) 

and the usual chain rule applies. 

The norm, II X II , can be any quadratic form 

II X II = /x t mx 


where M is, numerically, positive definite. Nothing is specified about M 
other than its existence, but it may be noted that if f(X) is continuous, 
differentiable, etc., for one such norm, the same is true for all such norms. 
For example, the matrix was used in the text when the elements of x 

were interpreted as state estimation errors, x. The introduction of II X II is 
merely a convenience and nothing is actually added, topologically, to the 
context by its introduction. 

A modified Euler theorem can now be given: 

If f(X) is homogeneous of degree p for the integers { J i , . . J n } 
and Frechet-differentiable at X then 

vf (X) T 5 = pf(X) 


where 


5 = 



(C9) 


The proof is again obtained by taking the derivative of both sides of (C2) 
with respect to A and evaluating at A = 1. The validity of these steps is 
assured by the assumed Frechet-diff erentiability of f(X). 

SUFFICIENT CONDITIONS FOR UNOBSERVABLE STATES 

In the following {X^(t), t Q < t < tp} is the reference trajectory, and 
{mp(X), i = 1, . . . , k} and the components of f(X) are all assumed defined 
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with continuous partials on an open set S in X. Continuous partials are 
sufficient to insure Frechet-differentiability. 


If there is a set of integers {Ji, . . J n } for which 


(1) f (AX) = Af (X) 

(2) mi (AX) = rnifX) i = 1, . . k 


(CIO) 


for all X in S and all real X such that AX is in S, then the 
state 

So = [ji]x A (to) (Cll) 

is unobservable. 

To prove this, let f 1 > • • • » ^ n be the components of f and observe that 
the first condition in (CIO) can be written as 

f i (AX) = A J if i (X); i = 1, . . n 

Apply Euler's theorem (C9) to these components to obtain 


Vfi(X) 


[ Ji Jx = [ dijfcx) 


(C12) 


For convenience, define 


5(t) = [di Jx A (t) 

and evaluate (C12) on the reference trajectory, {X A (t), t Q < t < tp} 


(C13) 


IvfT 


s(t) = [di ]f(x A ct)) = [* Ji Jx A (t) = set) 


- J x A (t) 
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Thus, £(t) satisfies the linearized state equation, whence 


5 (t) = *(t, t„) 5 (t 0 ) (04) 

Next, apply Euler's theorem to the data types, which are homogeneous of degree 
zero for the integers {Ji, . . ., Jn)* 

Vm i (X) T £ J ± Jx = 0 i = 1, . . . , k 

Evaluating this on the reference trajectory, obtain 

v mi(XA(t)) T £(t) = h i (t) T 5(t 0 ) = 0 i = 1, . . . , k; t 0 < t < tp (05) 

This result gives 5(t 0 ) as an unobservable state. As usual the symbol hi(t) 
refers to 4>T(t, t 0 ) Vmi(XA(t)) . 
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APPENDIX D 


EXAMPLE - UNACCELERATED MOTION 


The discussion of observability is illustrated in the simple example of 
this section. An unaccelerated particle is under observation from an 
inertial ly fixed radar station measuring range. Assume an inertial coordinate 
frame with origin at the station. 

An unaccelerated particle moves in a straight line with constant velocity 
so that its motion is given by 


R(t) = R 0 + V Q tj 
V(t) = V 0 I 


0 ^ t < 00 


(Dl) 


where R 0 , V 0 are the unknown initial position and velocity of the particle. 

The measurements are samplings of the phase shift between transmitted and 
received signals, which can be represented as the ratio 


m(t) = j R ( t ) \/X 


(D2) 


where A is the radar wavelength. 

The state is a list of all independent quantities required to define (Dl) 
and (D2) 


X(t) = 



(D3) 


and contains seven independent variables. The object is to estimate the 
initial state, X 0 , of the system. 


OBSERVABLE PARAMETER SPACE 


In this problem the general solution of the state equation is already 
known and linear, so the data type can be given directly in terms of the 
unknown initial state 

m(t) = j R 0 + V 0 tjA 
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The measurement costate is the gradient of m(t) with respect to the variables 
of X 0 


h(t) = 

A|R(t)| 


C-l/A) (Rq + 2^ 0 t + V 2 t 2 ) 
Rn + Lt 


R 0 t + V Q t 2 


(D4) 


where R, R 0 , V 0 are magnitudes of the vectors R, R 0 , and V Q , and the 
superscript (") indicates quantities evaluted on the reference initial state, 
X 0 , which is assumed calculated from some initial set of measurements. 


The observable parameter space is the space spanned by the measurement 
costate during the interval of observation which is taken here as 


Jfo = £ {h(t) , 0 < t < 


If the measurement costate is given analytically , as in this example, then a 
basis of can be obtained by expanding h(t) in the form 


h (t) = a 1 (t)hi + . . . + aj c (t)hj c 0 < t < 00 


that is, in a linear combination of fixed vectors, hi, h 2 > . . . , h^. Then 
{hi, . . ., h^} is a basis of 2^ provided they are independent vectors and 
provided 04 (t), . . a^(t) are linearly independent time functions over the 

interval of observation. 

The measurement costate for this example (eq. (D4) ) can be written as a 
sum of terms in powers of t as 


h(t) = 


R(t) 



V 0 R 0 


R(t) 


•2V^R 0 /XR 0 V 0 > 

v 0 /R 0 v 0 
R 0 / R o v o 


t 2 vs 


R(t) A 



(D5) 


The superscript (") is understood for all quantities in equation (D5) . The 
coefficient time functions R 0 /R(t), tR 0 V 0 /AR(t), and t 2 Vo/AR(t) have been 
nondimensionalized . These functions are linearly independent on every non- 
trivial time interval. Further, the three fixed vectors in (D5) are 
independent so that the columns of 
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(D6) 


- 


'-R 0 A 2 

R 0 / AR 0 
0 


-2vJr 0 /AR 0 V o 

V 0 /V 0 R 0 

Rq/ r o v o 


-1/A" 

0 

V 0 /Vo. 


form a basis of 3^ and 


dim (3C m ) = 3 


(D7) 


It may be noted that h(t) spans the same observable space regardless of the 
interval of observation. 


In view of (D7) only three independent parameters (functions of X Q ) 
describing the motion of an unaccelerated vehicle can be estimated from 
measurements by an unaccelerated radar ranging station. These parameters can 
be identified by noting that the columns of H m are the gradients of 


PiCXo) 


P 2 (X Q ) 

P 3 (Xo) 



which, respectively, are the initial distance in wavelengths, the dot product 
of initial position and velocity, and the speed in wavelengths per second. 


UNOBSERVABLE STATES 


The unobservable state space for this problem is four-dimensional since 


dim(x u )= n - dim (3^) =7-3 = 4 


These states are the initial states that define trajectories for which the 
linearized output is zero; that is, they are the solutions of 

H m*u = 0 

A basis of X u is given by any four independent solutions of (D8) , with H m 
given in equation (D6) . For example 
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0 


r a o o 


X u 


R 0 R 0 P 0 pxR 0 



V oP P x ^o J x 


CD9) 


Here, p is the unit vector perpendicular to the plane of R Q , V 0 . It can be 
verified that the columns of (D9) satisfy (D8) and are independent. The first 
column of (D9) is recognized as the unobservable calibration state, which is 
known to occur in the present problem since the data type is not a calibra- 
tion. This state can also be given easily by applying statement [13] of the 
text. 

The unobservable states of the linearized problem usually correspond to 
indistinguishable trajectories of the original nonlinear problem. These are 
solutions of the state equation for which the output time history is identical 
to that of the reference solution. The calibration state corresponds to 
uniform scaling of all distances, including the radar wavelength; that is, 
the initial states X^, X Q generate identical output if 


t ^ 

X Q = aX G , a arbitrary positive 


The remaining three unobservable states in (D9) correspond to trajectories 
for which the station wavelength and the range time-history are, separately, 
equal to that of the reference trajectory. This occurs when the initial range, 
R 0 , initial speed, V Q , and the angle, <(R Q , V Q ) , are the same as their refer- 
ence values. Three arbitrary choices define this set of indistinguishable 
trajectories, two to locate the initial position vector and one to locate the 
velocity vector. The last three columns of (D9) represent three analogous 
independent choices for which the linearized range time-history is identical. 


INITIAL MEASUREMENTS AND CALIBRATION ACCURACY 


The analysis of calibration accuracy is illustrated now. The data type 
of this example is not a calibration so that the problem is calibrated (com- 
pared to the meter) by some initial set of measurements which are also assumed 
sufficient to determine the reference trajectory and initial covariance of the 
linearized estimation problem. 

An example set of initial measurements is given next. This set is 
intended to be simple rather than closely resemble what might occur in 
practice. It just suffices to uniquely determine a reference trajectory and 
contains just one calibration. 

The radar wavelength is assumed determined from the relation 

A = c/2iTf (DIO) 
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and from measurements of the radar frequency f, and a determination of the 
speed of light by measuring a distance d, and the time At, it takes light to 
travel this distance. The first three measurements are then 


m^ = d = 2irAf At \ 


m 2 = At > 

m 3 = f / 


(Dlla) 


The measurement, m^, is the only distance calibration to appear in the initial 
measurements . The calibrated distance, d, is given above_ in terms of other 
unknowns by using equation (DIO) and the relation c = d/At. In practice, the 
current value of the speed of light results from many experiments similar to 
{m ls m 2 }. 


To complete the determination of the reference initial state, the radar 
is used to measure range and antenna pointing angles (right ascension and 
declination) at two times, t = 0 and t = tj . The clock is assumed started 
simultaneously with the first measurement. These measurements are 


m^ = cos ^//x 2 + y 2 
m 5 = sin 1 z/R 0 
m 6 = R 0 /A 
m7 = tj 

mg = cos 1 (x + xti)//(x + xti) 2 + (y + yti) 2 
m 9 = sin - 1 (z + zt i ) / | R 0 + | 

m 10 = l^o + V 0 tl l/ A 


(Dllb) 


Here, x, y, . . . , z are the Cartesian components of R 0 , V 0 . All these 
measurements are assumed made with independent gaussian error having zero 
mean and MS values, q x , . . ., q 1Q . These measurements can be defined in 
terms of 10 independent unknowns 
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Y = 


At 

f 

t l 

X 

*o 

V 


(D12) 


These are partitioned above into the state variables, X Q , of the main problem 
and three additional variables, Z, which are required to define all the 
initial measurements but are irrelevant to the main problem. 

The costates of the a priori, measurements (their gradients with respect 
to Y) are given, in order, by the columns of 


h a = 


d/At 

1 

0 

1 

1 


I 0 

1 

1 

0 

0 

0 


d/f 

0 

1 

1 

1 


1 0 

1 

1 

0 

0 

0 


0 

0 

0 

1 

1 


1 

1 1 

1 

1 


VqMi 



[ H z] 

— 

— 



1 

— 

J 


— 



= 


d/X 

0 

0 

1 0 

0 

-R 0 /X 2 | 0 

1 

1 

0 

0 

-RiA 2 


LHxJ 




1 

1 

1 

M 0 

1 

I 0 
1 

1 

1 


Ml 



_ 

<t> 


1 

1 

1 


! o 
1 

1 

1 

1 


tiMi 




(D13) 


Here, the 3x3 submatrix, M 0 , is 


Mq = 


pxu Q 


v^ 


+ y* 


JL 


u 


where p is the unit vector perpendicular to plane of R 0 , V 0 , and u Q is 
R 0 /R 0 . The matrix. Mi, has the same definition except that quantities are 
evaluated at time, 1 1 . 

It can be shown that is nonsingular so that the 10 measurements, 

mi (Y) , . . ., mio(Y), can be inverted to obtain a unique solution for Y in 
terms of the data. This contains the required^solution for X Q . The initial 
covariance, P^, associated with the estimate, X Q , is given by the formula 
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(e[xx t ]) _1 = h^q ; 1 - 


(D14) 


P 


-1 

A 


A 

where quantities are evaluated at Y and is the diagonal matrix containing 
q 1 , . . . , q 1Q . The calibration accuracy is then given from 

Hi = («oi p i l 5 ot )' 1 

where g - is the calibration state from equation (D9) . (Alternatively, 
can be computed from eq. (54).) Omitting lengthy but routine algebra, the 
calibration accuracy is 


Hi 



(D15) 


It is recognized that y ^ is the accuracy with which A can be estimated 
from the measurements {mi, m2, m3}. This accuracy is poorer than that of the 
actual calibration, q^d 2 . 

The equation for the gradient of the meter stick of the problem is 


hi* 

£*(X 0 ) 


y ZZ P A 5 oZ 



This does not uniquely specify £*(X 0 ), but any length function of X Q which 
has this gradient can be taken as the meter stick; for example 

7*(X) = X (D16) 

In this example, the meter stick has a natural interpretation as the radar 
wavelength. If the speed of light is determined from redundant experiments of 
the same kind as {mi, m2} then the meter stick is again X and the calibra- 
tion accuracy is again the accuracy with which X can be estimated from these 
initial experiments. However, if the problem is a more realistic one (e.g., 
that of estimating particle motion in a gravitational field from an earth- 
based radar range station), then the meter stick will not generally be X nor 
have a simple physical interpretation. 

The estimation of the particle trajectory in this example is, at best, a 
process of comparing all distances to the radar wavelength, and the accuracy 
to which any distance can be estimated is limited by the accuracy with which 
the wavelength is estimated initially. There are, however, unobservable 
states in addition to the calibration state, so that a lower bound on 
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estimation accuracy, due to all the unobservable states, occurs and is poorer 
than This lower bound can be determined by separating into two 

parts associated, respectively, with the observable and unobservable states. 

If desired, this separation can be carried out by the procedure described in 
the text (eq. (30)), and then a lower bound on the accuracy to which the state 
variables can be estimated from the range data is given by the diagonal 
elements of the part associated with the unobservable states. 
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